Building Multilingual Glossaries With AI Support

Intro Recently, I was requested to find a way to build multilingual glossaries leveraging translation memory content. My client had translated an entire year of educational content using professional translators in seven different languages. The challenge was bipartite: Build all the required glossaries in a relatively short timeframe. Have a final human review step to validate the terms without breaking the bank. In other words, I had to create the glossary programmatically, using a pre-built list of English terms (source language), and then look for their translations, if any, in translation memories following an “exact match” approach....

July 5, 2025 · 3 min · 632 words · Joshua D. Velásquez

Bilingual Glossary Creation Experiment With Claude 2

TL;RD Among the myriad tasks you can request a GPT model to perform, scraping or parsing will be one of the most attractive to localization engineers since you can build glossaries and translation memories out of related multilingual files. Most GPT models won’t let you scrape websites, but you can circumvent this sane limitation using a markdown clipper. Advanced Python users can use BeautifulSoup. Few-shot prompting will help the model extract the desired information....

December 9, 2023 · 5 min · 1055 words · Joshua D. Velásquez