How to detect the Korean character right before "을 " or "를 " and do some unicode value calculation of its final consonant?

sally1 · November 2, 2017, 4:39am

What I need to detect is

the character right before "을 " or "를 ",
and get the unicode value of the character,
and do the calculation like “(above unicode value-44032)%28”
and check if the result is zero(0) or not.

The final result(Zero or Non-Zero) will decide which one of “을” or “를” should be appended after the character, which is very helpful to detect common errors in Korean.

For example, from “사람를 만나다”, I need to detect “람” right before "를 ", then get the unicode value of “람”, then do the calculation “(above unicode value-44032)%28”, then check if the result is zero or not.
For this case, it should be non-zero, so I can see that "를 " is an error and should be changed to "을 ".

Thank you.

pcondal · November 2, 2017, 6:34pm

Xbench Regex in checklists does not support math formulas. To detect segments with this pattern, you would need to write a QA plugin (it requires programming skills in your team).

Topic		Replies	Views
Android Escape Characer RegEx Power Search Technical Support	3	1172	May 4, 2020
REGEXP to find one character exactly once in a segment Technical Support	2	180	September 27, 2023
Special characters mismatch General Discussion	3	1970	March 23, 2016
Straight quotes should be left as in source Technical Support	3	988	December 14, 2018
Check Spanish punctuation Technical Support	2	136	November 14, 2023

How to detect the Korean character right before "을 " or "를 " and do some unicode value calculation of its final consonant?

Related Topics