I was wondering if it is possible to use regular expressions to check splitting sentences when a translator is instructed not to split them and does it anyway. Basically, to check if there is the same number of sentences in the target as in the source.
Source segment: There is an incredibly long sentence in the source language which would be better split into more shorter sentences in the target language but for whatever reason, the translator mustn’t do so. (1 sentence)
Target segment: Short sentence one. Short sentence two. Short sentence three. (3 sentences)
The desired checker would flag this example as an error. Perhaps it would be possible to use a regexp that would check the number of periods but on the other hand that might lead to false alerts when there are ordinals in the target segment as well.