1. 05 Dec, 2008 1 commit
    • Andrew Hodgkinson's avatar
      Fix bugs and inconsistencies in encoding handlers. · 69a25816
      Andrew Hodgkinson authored
        Fix inconsistency in handling illegal byte sequences.
        Convert surrogate codepoints and U+FFFE, U+FFFF to U+FFFD.
        Also, a few extra mappings.
      Detail:
        enc_utf8.c: 0x80 is a continuation byte. Map stray ones to U+FFFD.
                    Reset the count of expected continuation bytes to 0 when
                    encountering illegal byte sequences. Previously, if the character
                    callback returned non-zero, this count would not be reset, thus
                    leaving the codec in an inconsistent state. Additionally, we no
                    longer consume the illegal continuation byte: instead, we process
                    it as a start byte next time round.
        encoding.c: Do not load extension tables for ISO-8859-{1,2,9,10,15,16}
                    If these are needed, it's probably best that different charset
                    names are used rather than overloading 8859-n.
        iso2022.c:  Permit SS2/3 escape sequences for EUC encode/decode.
                    Disable C1 chara...
      69a25816
  2. 25 Aug, 2005 1 commit
  3. 05 Oct, 2000 1 commit
    • John Beranek's avatar
      Copyright message changes + changes from branch + Unified branched/non-branched builds · b5fafb8f
      John Beranek authored
      Detail:
        Copyright messages changed from E-14 to Pace throughout, filename
         placed at top of file throughout, instead of in just some files.
      
        Merged branch's fixes into our code base, plus made it possible to
         get nice debug output in branched tree, and vfprintf() to stderr in
         RISC OS tree.  Exactly same source used in branched tree now (apart
         from OS specific files riscos.c and unix.c moving into layers
         directory structure).
      
      Admin:
        Built for branched, both Unix and RISC OS.
        Built in RISC OS tree, and compiled into TextConv.
      
      
      Version 0.45. Tagged as 'Unicode-0_45'
      b5fafb8f
  4. 12 Mar, 1999 1 commit
  5. 11 Mar, 1999 1 commit
    • Kevin Bracey's avatar
      Implemented SCSU and UTF-7. · 30550b96
      Kevin Bracey authored
      Added encoding_set_flags().
      Proper handling of byte order marks in UTF-16 and UCS-4.
      Fixed UTF-16 surrogate writing.
      Adjusted various MIME charset identifiers.
      Incorporated latest Unicode Character Database (2.1.8).
      Added "current system alphabet" encoding.
      Created "TextConv" command line character set conversion utility.
      
      Version 0.34. Tagged as 'Unicode-0_34'
      30550b96