Ssddiff

Jul 20, 2023

Diff application for semistructured data such as XML files

Semistructured data is a generic term for data that does have structure information, while not being tabular or very tightly restricted. XML and HTML data is the most prominent examples for this. You normally would not use this term e.g. for database tables which for example do not allow nesting of entries.

While this application currently only supports XML, the algorithms should be able to process other semistructured data as well.

The approach used here is usually much slower than other well-known xmldiff applications, however it produces better results in many “tricky” cases. You say that other xmldiff applications try to do a syntactic diff, whereas xmldiff tries to do a semantic diff.



Checkout these related ports:
  • Zxing-cpp - ZXing C++ Library for QR code recognition
  • Zu-hunspell - Zulu hunspell dictionaries
  • Zu-aspell - Aspell Zulu dictionary
  • Zq - Easier and faster alternative to jq
  • Zorba - General purpose C++ XQuery processor
  • Zenxml - Simple C++ XML Processing
  • Zed - Command-line tool to manage and query Zed data lakes
  • Yq - Command-line YAML and XML processor, jq wrapper for YAML/XML documents
  • Yould - Pronounceable word generator
  • Yodl - Easy to use but powerful document formatting/preparation language
  • Yi-hunspell - Yiddish hunspell dictionaries
  • Yi-aspell - Aspell Yiddish dictionary
  • Yelp-xsl - DocBook XSLT stylesheets for yelp
  • Yelp-tools - Utilities to help manage documentation for Yelp and the web
  • Ydiff - Diff readability enhancer for color terminals