Data extraction solutions have advanced significantly in the last 2 to 3 years, allowing us to do things today that were impossible just a few months ago. One of the most notable improvements is the ability of these recent solutions to understand the context of a document.
If you missed this mini-series introduction, you can find it here: Mini-series: the real power of new data extraction solutions.. In the previous article, we discussed how to handle date formats, which you can access here: Smart Date Formatting.
This second article will explore another real-world use case: ditto marks.
What are Ditto marks?
You may be wondering what a ditto mark is, but chances are, you already know it, not by that name. A ditto mark is a double quote character that indicates that the word or value above it should be repeated. It is commonly used in handwritten documents where the precious “copy-paste” does not exist.
Let's look at an example. When listing items o