Delimiters

Structure usually comes with regular patterns. Delimiters show when a structured field begins and ends. We're so used to space as a delimiter we forget it was once an invention!

1. INNOVAFERTANIMVSMVTATASDICEREFORMASCORPORADICOEPTISNAMVOSMVTASTISETILLASADSPIRATEMEISPRIMAQVEABORIGINEMVNDIADMEAPERPETVVMDEDVCITETEMPORACARMEN
2. In nova fert animvs mvtatas dicere formas corpora; di, coeptis (nam vos mvtastis et illas) adspirate meis primaqve ab origine mvndi ad mea perpetvvm dedvcite tempora carmen![2]
3. I want to speak about bodies changed into new forms. You, gods, since you are the ones who alter these, and all other things, inspire my attempt, and spin out a continuous thread of words, from the world's first origins to my own time.

In free text, we use space for a delimiter.
In CSV, we use , for a delimiter.
In TSV, we use tab for a delimiter.
In DarwinCore we CSVs, and then use | for a delimiter within a field.

Natural Language Processing

What is natural language?

Semi-structured text

Semi-structured text (continued)

Delimiters

Using delimters

Regular Expressions

Multi-character delimiters

Mixing structure with semi-structure

Excel Regex Demo data

Excel Regex Demo solutions

Word stems

Word Stem Dictionaries