Teckit

Jul 20, 2023

Toolkit for converting data between 8-bit legacy encodings and Unicode

TECkit Text Encoding Conversion toolkit is a toolkit for converting data between 8-bit legacy encodings and Unicode. It can also be used for transliteration of Unicode between different scripts.

TECkit uses a mapping description language mapping byte encodings to Unicode. Mapping rules can be extended by 1 the use of character sequences rather than single characters on either side; 2 by the addition of contextual constraints environments determining when a rule should apply; 3 and by the use of character classes, optional and repeatable elements, grouping and alternation to express more complex patterns to be matched and processed.

TECkit is particularly useful with XeTeX Unicode-aware derivate of TeX.

The following binaries are provided

teckit_compile mapping compiler that allows binary mapping tables .tec to be built from TECkit description files .map sfconv a tool for converting Standard Format SF files txtconv a utility to apply TECkit mappings to plain-text files

 http//scripts.sil.org/TECkitDownloads#5b6cf869


Checkout these related ports:
  • Zxing-cpp - ZXing C++ Library for QR code recognition
  • Zu-hunspell - Zulu hunspell dictionaries
  • Zu-aspell - Aspell Zulu dictionary
  • Zq - Easier and faster alternative to jq
  • Zorba - General purpose C++ XQuery processor
  • Zenxml - Simple C++ XML Processing
  • Zed - Command-line tool to manage and query Zed data lakes
  • Yq - Command-line YAML and XML processor, jq wrapper for YAML/XML documents
  • Yould - Pronounceable word generator
  • Yodl - Easy to use but powerful document formatting/preparation language
  • Yi-hunspell - Yiddish hunspell dictionaries
  • Yi-aspell - Aspell Yiddish dictionary
  • Yelp-xsl - DocBook XSLT stylesheets for yelp
  • Yelp-tools - Utilities to help manage documentation for Yelp and the web
  • Ydiff - Diff readability enhancer for color terminals