encoding   14289

« earlier    

Is it possible to force Excel recognize UTF-8 CSV files automatically? - Stack Overflow

A solution is to not use CSV at all. This format is implemented so stupidly by Microsoft that it depends on the region settings in control panel if comma or semicolon is used as separator. So the same CSV file may open correctly on one computer but on anther computer not. "CSV" means "Comma Separated Values" but for example on a german Windows by default semicolon must be used as separator while comma does not work. (Here it should be named SSV = Semicolon Separated Values) CSV files cannot be interchanged between different language versions of Windows. This is an additional problem to the UTF-8 problem.
encoding  excel 
10 days ago by danwin
Compact Disc Structure | byuu.net
A look into the data that's stored on CD-ROMs, how that's encoded into CD image files, and a proposal for a more complete CD-ROM archival format.
reference  data_structures  article  cd  compact_disc  cdrom  hardware  encoding  audio 
17 days ago by AlanRalph
The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) – Joel on Software

Ever wonder about that mysterious Content-Type tag? You know, the one you’re supposed to put in HTML and you never quite know what it should be?

Did you ever get an email from your friends in Bulgaria with the subject line “???? ?????? ??? ????”?

I’ve been dismayed to discover just how many software developers aren’t really completely up to speed on the mysterious world of character sets, encodings, Unicode, all that stuff. A couple of years ago, a beta tester for FogBUGZ was wondering whether it could handle incoming email in Japanese. Japanese? They have email in Japanese? I had no idea. When I looked closely at the commercial ActiveX control we were using to parse MIME email messages, we discovered it was doing exactly the wrong thing with character sets, so we actually had to write heroic code to undo the wrong conversion it had done and redo it correctly. When I looked into another commercial library, it, too, had a completely broken character code implementation. I corresponded with the developer of that package and he sort of thought they “couldn’t do anything about it.” Like many programmers, he just wished it would all blow over somehow.

But it won’t. When I discovered that the popular web development tool PHP has almost complete ignorance of character encoding issues, blithely using 8 bits for characters, making it darn near impossible to develop good international web applications, I thought, enough is enough.

utf  unicode  encoding  blog  programming 
18 days ago by ndf
utf 8 - Adding BOM to UTF-8 files - Stack Overflow

(Answer based on https://stackoverflow.com/a/9815107/1260896 by yingted)

To add BOMs to the all the files that start with "foo-", you can use sed. sed has an option to make a backup.

sed -i '1s/^\(\xef\xbb\xbf\)\?/\xef\xbb\xbf/' foo-*

If you know for sure there is no BOM already, you can simplify the command:

sed -i '1s/^/\xef\xbb\xbf/' foo-*

Make sure you need to set UTF-8, because i.e. UTF-16 is different (otherwise check How can I re-add a unicode byte order marker in linux?)
ADMINS  project  utf8  encoding 
18 days ago by ndf
Base64 - base64.guru
Almost everything about Base64 for those who want to convert, develop, or learn the most popular binary-to-text algorithm
encoding  data 
20 days ago by insertrealname

« earlier    

related tags

*****  2019  activism  admins  adobe  adt  ai  air  airgap  alternative  analysis  article  ascii  audio  automation  bandwidth  base64  base91  best  bestpractices  bildformat  binary-to-text  bitrate  bits  blindingly_useful  blog  bluetooth  burp  by:paulmccann  categorization  cd  cdrom  certificates  character  characters  characterset  chrome  cipher  code  code_point  codec  coding  compact_disc  compression  computer.science  computing  conferencing  configuration  conversion  convert  cool  cpu  crt  crypto  cryptography  css  ctf  dammitbrain  data  data_structures  dataglyph  debates  decode  decoding  decryption  degradation  der  design  detector  dev  digipres  digital.circuits  digitaldecay  doom9  dvd  editing  eff  emacs  encryption  entertainment  error  escape  excel  extension  fast  ffmpeg  fileformat  firefox  floss  forensics  format  frame  free  fun  geocaching  gif  go  golang  google  gpu  graphics  guide  gzip  h264  hardware  hash  hashes  hashing  henrisivonen  hex  history  howto  html  http  i18n  identification  identifier  image  images  important  in-depth  issue  japan  javascript-cookbook  javascript  jpeg  js  json  language  languages  latency  library  live  ln  locale  low  mac  machine.learning  machinelearning  macos  marquesbrownlee  media  mediaencoder  microglyph  modeline  mp4  mpeg-4  ncc  network  nlp  nvidia  ocr  opensource  optimization  paper  pem  performance  pike  post  postproduction  powershell  practices  printer  printing  programming  project  protocol  python  pyython  quality  quicklook  rate  read  realtime  reference  research  resolution  ritchie  rob  salsify  secrecy  secret  security  setting  size  smartpaper  software  solution  sound  speed  spolsky  spy  ssl  stackoverflow  standard  standards  stanford  state.machines  steganography  storage  streaming  string-processing  stringconcatenation  strings  subtitles  sumtype  svg  swift  tech  technical  technology  terminal  text  throughput  tips  tool  tools  transcoding  transfer  troubleshooting  ucs-2  ucs  ultrasonic  unicode  unzip  url  utf-16  utf-8  utf  utf8  vast  video.editing  video  vim  vp9  web-applications  web  webdev  webm  windows  xerox  youtube  zip 

Copy this bookmark: