regular expressions library
Tagapagpost sa thread: Luna Media
Luna Media
Luna Media
Estados Unidos
Ingles papuntang Spanish
+ ...
Jun 15, 2024

My colleagues and I are trying to build a library of regular expressions to help us fix machine translation errors automatically. Before we get too deep into learning and building it, we were wondering if such a thing already exists out there. Google searches did not help locate a "library regular expressions for localization" and it would be a bit time-consuming to find examples throughout the forum, which I'm sure there are plenty. Thank you very much!

 
Cilian O'Tuama
Cilian O'Tuama  Identity Verified
Alemanya
Local time: 04:47
Aleman papuntang Ingles
+ ...
Machine versus machine? Jun 16, 2024

Maybe I've misunderstood, but it sounds like you want to improve MT by using another machine?

Automatically replace certain terms with others? Would that improve anything?

More details please.

A list of 'regular expressions" would be endless.


 
Dan Lucas
Dan Lucas  Identity Verified
United Kingdom
Local time: 03:47
Kasapi (2014)
Japanese papuntang Ingles
Handcraft them Jun 16, 2024

Luna Media wrote:
My colleagues and I are trying to build a library of regular expressions to help us fix machine translation errors automatically.

Personally, I don't think a library or any other kind of prebuilt collection will be of much use to you, because you will have to inspect each case individually. It will depend very much on the language pair, the content of the text, and so on.

Regexes are powerful tools, but there are lots of edge cases and they can be destructive if deployed carelessly or indiscriminately. I use them fairly frequently and can testify that is very easy to make mistakes that ruin large sections of your document. This potential for disaster may be the origin of the following well-known observation by Jamie Zawinski:

Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.
If you really want to do this, I would abandon the idea of a pre-built library and just start crafting one example at a time, and testing very carefully (in a sandbox) as you go. That way you will at least understand what you are doing. If you're new to regexes and I suggest looking at a tool like RegexBuddy.

Regards,
Dan


Jennifer Levey
Stepan Konev
CafeTran Trainer
Darius Sciuka
Alison Jenner
 
Luna Media
Luna Media
Estados Unidos
Ingles papuntang Spanish
+ ...
TOPIC STARTER
Thank you Jun 16, 2024

Thank you for all the suggestions and input.

 


Walang moderator na tanging nakatakda sa forum na ito.
To report site rules violations or get help, please contact site staff »


regular expressions library







Pastey
Your smart companion app

Pastey is an innovative desktop application that bridges the gap between human expertise and artificial intelligence. With intuitive keyboard shortcuts, Pastey transforms your source text into AI-powered draft translations.

Find out more »
Anycount & Translation Office 3000
Translation Office 3000

Translation Office 3000 is an advanced accounting tool for freelance translators and small agencies. TO3000 easily and seamlessly integrates with the business life of professional freelance translators.

More info »