Diving Into DCP Files for XML Compare

What are the default settings in a DCP file?

An empty DCP file will have these settings by default.

  • Word by Word – Differences in all elements are resolved down to the word level
  • Whitespace Normalisation – Sequences of whitespace characters are normalised to a single space character
  • Table Processing – Elements conforming to the HTML and CALS specifications are recognised and processed specially to keep the output valid, provided the input is valid.
  • Comparison Order – The comparator matches elements based on their document order.
  • Change Gathering – When sibling elements or text in the comparison result file are a mix of added and deleted content, it is by default reordered to show all deleted content followed by all added content.

All of these settings and more can be tweaked by using special deltaxml attributes.  Full details can be found on Document Comparator Guide in the section on Customizing a comparison.

Understanding the DCP Schema File

w3schools provides a great introductory tutorial about XML Schemas. There are currently just two schema (XSD) files in XML Compare’s samples/dcp directory.   You can choose in the DCP file whether you want validation against XML schema 1.0 or 1.1 by including the appropriate file as explained in a comment at the start of the DCP file.  v1.1 has some additional constraints, improving your editing experience, and is preferred. However some editors that our customers use to change DCP files may not support schema 1.1. For example, with the oXygen XML Editor, versions from 14.2 onwards support scheme 1.1, see www.oxygenxml.com/xml_editor/whatisnew14.2.html

Choosing an Extension Point

In most cases you can put your filter in the INPUT_A_PRE_TABLE or the INPUT_B_PRE_TABLE extension points for the input pipelines or the OUTPUT_FINAL extension point for the output pipeline as required.

In more complex cases, where you want custom table processing or filters to adjust the formatting, then you need to refer to the JavaDoc or get DeltaXML support

Adding external filters to the DCP file

The Custom Filters section of the Document Comparator Guide, explains how to add XSLT filters to the pipeline, using a filter that creates a key from an xml:id attribute as an example

A Beginner’s Guide to XML Comparison

For more information about pipelines and the differences between DXP and DCP see here

For in-depth information about DXP start here.

The location of DXP/DCP files on a computer

Links to useful documentation

Keep Reading

Managing Risk in Legal Documentation

/
Proactively addressing compliance, accuracy, and security risks in legal documentation is essential to protect from costly errors.

Ensuring Accuracy in Legal Documentation

/
Efficient document comparison and merging can drastically improve accuracy, collaboration, and compliance for legal teams.

Introducing HTML Compare

/
HTML Compare is your go-to for tracking, comparing, and managing HTML content changes with ease, offering clear visual highlights and customisable settings.

Introducing Subtree Processing Mode for Greater Flexibility

A new feature that lets you control how content is compared by processing sections as either text or data.

Beyond Step-Through XSLT Debugging

Print-debugging in XSLT provides a broader view of code behaviour by capturing variable values at multiple points.

Solving Common Challenges with Inaccurate Document Management

Discover practical strategies to overcome common challenges in regulated industries.

How to avoid non-compliance when updating technical documents in regulated industries

Navigate the challenges of updating technical documents in regulated industries.

Built-in XML Comparison vs Document Management Systems (DMS)

Compare using specialised XML comparison software versus a DMS in regulated industries.

How Move Detection Improves Document Management

Learn how move detection technology improves document management by accurately tracking relocated content.