ApSIC Xbench Forum

Fine partial matches in source/target

Is there a regex that can warn us for partial matches. We have a number identified a number of users who are copying source to target and not completing the translation. Any ideas of the type of checks we can use to identify partial source matching target? 20 or more characters matchign or something?

e.g.

[source]
The JUKE NISMO RS increases the engine’s performance to 215 horsepower (at 6,000 rpm) and 210 lb-ft of torque (from 3,600 to 4,800 rpm) for the 6-speed manual transmission-equipped front-wheel drive model.

[target]
JUKE NISMO RS کار انجن را به 215 horsepower (به 6,000 rpm) و 210 lb-ft torque (از 3,600 الی 4,800 rpm) افزایش میدهد for the 6-speed manual transmission-equipped front-wheel drive model.

1 Like

Hi Christopher,

I have tried several regex but couldn’t find any that works. The only way would be to create a plugin that would detect segments that may be partial matches.

Here is the Xbench documentation to create a plugin: https://docs.xbench.net/programmer-reference-qa-plugins/. However, it requires programming skills.

Regards,
Oscar.

Since it is relatively likely that these source leftovers will be at the end, probably the following search that captures the last 20 characters in source and checks if they are also found in target should do the trick:

  • Source: (.{20})=1$
  • Target: @1$
  • Search Mode: Regex

You may want to experiment with other lengths than 20 (or have several lengths checked in a checklist and only review the shortest one that does not have too many false positives).

1 Like

Thinking a little outside the box: you might not need a regex at all – all the partially translated segments will most likely contain a fairly high number of words flagged by the spelling checker (at least for languages where a spelling checker is available). By running a QA report that only checks spelling issues, and then sorting it by segment number, you should be able to identify most of the partially untranslated segments.