Resources

A collection of papers, guides and videos surrounding the XML and JSON industry. Learn more about our products through useful customer stories and data sheets.

Customer Stories

After an acquisition by Karnov Group, XML sources were required to be compared, merged and organised. With 16,000 files to be compared and merged, and the average size of the files totalling to around 200 Kbytes, this was no easy task. That is, it wouldn’t have been an easy task without DeltaXML’s XML Compare solution.

It’s paramount that editors and reviewers are in constant sync when maintaining standards. This labour-intensive task can be hindered further when needing to find change between versions with ill-equipped tools. After testing many XML Comparison techniques, both manual and hand-coded, Norman Walsh was certain that DeltaXML were the only viable option when developing standards for W3C.

With the help from DeltaXML’s XML Compare toolkit, Wrycan delivered a solution which integrated XML Compare with their Content Base product. This allows NFPA to manage all their content in a single repository, adding automated comparisons in a product-agnostic form.

The ability to consistently identify updates and mirror changes in existing information carries added significance when working with Semcon where new product introductions are accompanied with tight deadlines. DeltaXML’s unique ability to compare structured documents with accuracy becomes a necessity in the workplace.

By using DeltaXML’s DITA Compare solution, Scribestar were able to create an intensive platform that helps those in the legal profession process and review large and complex documents.

XML Compare is used to compare the versions and present the difference reports to the reviewers and proofreaders. This process is initiated automatically eliminating human error and saving a majority of labour costs.

Papers and Articles

The shift towards XML and JSON is not merely an optional trend for modern air traffic control operations – it’s a crucial requirement that can majorly influence the efficiency, safety, and future growth of global air traffic.

Finding out what has changed in a CALS table is remarkably complicated. Additional complexity arises when authors use empty columns for layout or use column or row spans specified in unusual ways, or when applications simply do not follow the standard. Can we successfully show changes within tables?

Diff3 works surprisingly well for pretty-printed source code and similar documents. But it has frustrating limitations when used for XML or JSON or similar tree-based data formats. Can it be improved?

When ordered elements can be moved then we have something that has some common ground with orderless. This paper establishes a continuum between ordered information and orderless information and proposes that these are not as far apart as they might at first appear.

Diff3 works surprisingly well for pretty-printed source code and similar documents. But it has frustrating limitations when used for XML or JSON or similar tree-based data formats. Can it be improved?

Merging XML documents is a particularly tricky operation but is often required to consolidate or synchronise two or more independent edit paths or versions. As XML tools become more powerful, the possibility of achieving an intelligent merge of XML data sets become reality.

Software developers are familiar with merge, e.g., pulling together changes from one branch into another in a version control system. Graft is a more selective process, pulling changes from selected commits onto another branch. These two processes are often implemented in the same way, but there are important differences between the two that needs addressing.

The issue of how best to represent overlapping hierarchy in XML has been the topic of a number of papers over the years. This paper is a further contribution to this important issue but approaching the problem from a different direction.

JSON is now a widely used format for data both in web applications and more generally. However, systems and APIs that exchange JSON haven’t been able to take advantage of tracking tools. Can this be helped by processing JSON as XML?

CALS tables are used in many technical documentation standards. There are OASIS specifications for CALS tables which include a number of semantic rules to ensure table validity. This paper reports on some of our experiences with CALS table processing and validation.

A new best practice within the top business markets. This paper illustrates why in today’s economy you must oversee your document and data file management and change systems to continue the success of your company.

This paper describes a new representation for overlapping hierarchy in XML. In addition to handling variants of the structure of an XML document, this representation handles changes to text and attributes.

This paper describes how a divide-and-conquer approach might be used to handle the complex markup structure that results from recording multiple variants of an historical document in a single XML document.

This paper explores the advantages of using a generic approach to representing tracked changes in XML and the benefits to having a standard XML solution. Refers to work done for OpenDocument (ODF) standards and W3C ‘Change’ Community Group.

Branch and merge may be a software developer’s dream, but using this technique with structured documents can turn into a nightmare. The merge process is so often a manual one: cut and paste and get frustrated. Can structured XML merge turn the nightmare back into a dream?

This paper discusses different aspects of XML grammar design that should be considered when document comparison is a requirement. Presented at XML London 2013, June 15-16th, 2013.

This paper introduces a proposal of a standard mechanism for representing tracked changes in XML. It was presented at XML Prague 2013, Feb 9-10 2013.

Most corporate and institutional XML data is in constant flux. Managing change in XML environments, empowers businesses and organisations not just to record change but to use change as a powerful and valuable asset.

Sponsored by NLnet on behalf of the OpenDoc Society, this paper describes a change tracking format developed for OpenDocument. The format described provides a way of representing successive changes or edits to an XML document, typically, in one or more editing sessions.

Technical Paper

XML Change Tracking

This technical paper was written in response to the wider interest in the proposal for OpenDocument change tracking. It focuses on the use for general XML change tracking and how changes may be represented in XML markup or in Processing Instructions.

There are many different approaches to tracking document changes in XML and many different situations when finding change is vital. We review popular formats and editors to discover the advantages and disadvantages of each approach in the context of the different use cases.

White Paper

XML for Publishers

Discover how change, when harnessed and managed, can add real value to a publishing business. The widespread use of XML structured documentation systems, and the deployment of XML-aware change management tools, has made it possible to bring all of the advantages of structured information to the publishing of change.

Conference Paper

XML Pipeline Performance

This paper describes advanced methods for optimizing XML pipeline performance. It extends our original smaller study into ‘Filter Pipeline Performance’ on the saxon-help email list.

The importance of ‘standard’ document formats is growing. This paper seeks to raise awareness of the development and increased use of new open document formats, and the implications these can have for businesses.

We propose a general synchronisation grammar which can describe synchronization rule sets. For example, when handling three input files, we show that changes to elements can be described in terms of just seven possible permutations.

Using JAXP to construct pipelines of processing elements is a good idea; it allows complex problems to be decomposed into a number of simpler steps or components. However, in practice the construction of pipelines is often a difficult process for Java programmers.

Localisation of text to multiple target languages has always presented unique challenges. Although it is easy to translate a single version of a document into different languages, it is much more difficult to maintain translations over multiple versions of a document.

Although providing a high level of flexibility, XML is an ephemeral format. Unlike something like CVS which can store a history of versioned data, XML documents hold a snapshot of data. But could there be a solution?

In XML, standard tools fail to identify changes precisely, meaningfully and completely. Learn how to solve this problem intelligently, representing changes in an XML format that allows downstream processing in an XML pipeline.

As XML becomes ubiquitous so the need for powerful tools to manipulate XML data becomes more pressing. As XML tools become more powerful, so the possibility of achieving a genuine, intelligent merge of XML data sets becomes a reality.

This paper goes into some of the background to DeltaXML and the general requirements that it addresses.

Please note: DeltaXML has moved forward since this paper so it should be read as background material only.

Product Sheets and Industry Use Cases

Learn how DeltaXML is helping documentation and data specialists in the finance sector to keep tight control of change and achieve compliance more readily.

XML Compare is a complete toolkit for reliable XML diff comparison. Download the product sheet to understand how the solution helps to identify all the meaningful diffs between any two XML files.

XML Data Compare is a complete toolkit for reliable XML data comparison. Download the product sheet to understand how the solution helps to identify all the meaningful diffs between any two XML data files.

DeltaJSON is a collection of comparison tools used to help manage changing JSON data. Our sophisticated SaaS service allows access through our online app or via REST API. Download the product sheet to understand how the solution helps to identify all the meaningful diffs between your JSON files.

DeltaXML and DeltaJSON products look within the structure of your data to identify relevant differences, so you can diagnose, debug and process data efficiently.

DITA Compare is the most efficient and dependable way of finding all of the changes within your DITA content. Download the product sheet to understand how the solution helps to identify all the meaningful diffs between any two DITA files.

Industry Use Case

DeltaXML and Aerospace

Learn how DeltaXML is helping documentation specialists in the aerospace industry to transform the efficiency and accuracy of content change and merge.

XML Merge is a complete toolkit for reliably merging 3 or more XML documents or datasets into a single file. Download the product sheet to understand how the solution helps to identify all the meaningful diffs between your XML files.

DITA Merge is a complete toolkit for reliably merging 3 or more DITA documents into a single file. Download the product sheet to understand how the solution helps to identify all the meaningful diffs between your DITA files.

DocBook Compare is the most efficient and dependable way of finding all of the changes within your DocBook content. Download the product sheet to understand how the solution helps to identify all the meaningful diffs between any two DocBook files.

Learn how DeltaXML is helping documentation specialists in the pharmaceutical, medical and healthcare industries to transform the efficiency and accuracy of content change and merge.

Videos and Presentations

Whether you’re undertaking a one-time migration project, transitioning from a legacy format to an open XML standard, or routinely converting third-party content for integration into your internal CMS, Tristan explores strategies for success. From outsourcing to vendors to managing your in-house XSLT suite, we cover the diverse tools and approaches involved in the conversion process.

Whether following software schedules or setting your own pace, keeping up with change can be overwhelming. Deadlines raise questions about controlling content changes, preventing unintended modifications, and meeting regulatory requirements. This webinar shares practical tips on managing evolving content and taming potential chaos.

Publications change constantly throughout their lifecycle. More control over how change is represented speeds up and enhances your publishing process, especially when you can adapt the output of changes for different reviewers, editors or even customers.

Change is one of the dynamics of the publishing world. As structured documents transformed the world of publishing, and with the majority of those documents written in XML change tracking tools have failed to keep up.

Presented at Tekom 2020, Product Director Tristan Mitchell demonstrates how to create additional revenue around your document-based products by providing value-add to your customers.

Presented at Tekom 2020, Product Director Tristan Mitchell looks at how content changes across the whole content lifecycle and discusses how comparison tools can be used to help you keep control and maintain your sanity.

Product Demo

XML Compare Demo

Watch XML Compare in action as we take you through not only how XML Compare works but how it compares with other XML comparison techniques. This demo is a great introduction to the power behind DeltaXML’s most popular comparison toolkit.

Introduction to using XPath 3.1 Notebooks in Visual Studio Code. DeltaXML’s XPath Notebook extension adds comprehensive XPath 3.1 support to Visual Studio Code’s built-in notebooks.

Product Discussion

DeltaJSON Playlist

With line-based JSON diff tools having limited functionality, many of our customers have asked us for a comparison tool for JSON data. Watch as DeltaXML CEO, Robin La Fontaine, takes you through some quick videos of how DeltaJSON manages change.

It’s common to have data in two files that we need to merge together, two different people or two different processes have made changes. Does it matter who or what has made this change? This question might help to decide whether you need a 2-way or 3-way merge.

Learn how to install the DeltaXML oXygen Adaptor, a free facility for integrating some of our products with the oXygen XML Editor. This adaptor enables the DeltaXML comparison products to be run from within oXygen.

From checking new specs in airline manuals to making sure there are no extra zeros within pharmaceutical leaflets it’s important to know when your content changes, can you risk not knowing? For nearly as long as XML has existed DeltaXML has been developing and supporting XML comparison and merge solutions.

Other Media

Book Chapter

The XML Handbook

As production deadlines become tighter and tighter, the time available for checking changes to published data is being dramatically reduced. As more and more data is generated, exchanged and processed using XML the need to identify changes within XML files is becoming widely recognized.

The success of XML has recently renewed interest in change control on trees and semi-structured data. This comparative study tests the performance (speed and result quality) of DeltaXML, XYDiff and a number of other competing tools.