Correcting Translations that Don’t Use Unicode
Posted by: Mondele

Sometimes you will encounter an older translation project where a non-Unicode font was used to display the necessary characters for the target language. Unfortunately, while this worked at the time, it doesn’t convert well to other environments. So, we prefer to convert the files to the Unicode standard.

In this example it was pretty easy, because there were only two characters that needed to be changed. It is imaginable that we could face a much larger number of changes. Also, although the backslash character was used (`\`) because the project was in Markdown rather than USFM there was no conflict. If the project had been a book of the Bible (and used USFM) it would have been more difficult to fix.

It is best to do this work with the raw project text. There are other Help Desk articles that can help you with that if you need it.

The first thing we need to do is to compare the text we have received with the needed output. I was able to request a PDF from the local tech worker, and compare it with the text in the project.

In this case, q was used for ɛ and \ was used for ɔ.

There are a number of programs that can be used to change text in multiple files. On the Mac I use TextMate, and on Windows I use Notepad++. The needed characters in Unicode can be gotten through the Emoji & Symbols on the Mac and the charmap.exe program on Windows. You can also find the characters at https://www.unicode.org/charts/index.html.

Please note that in this case ɛ and ε are not the same character. The first is the open e from the International Phonetic Alphabet block of the Unicode standard, and the second is small epsilon, from the Greek block. We should be careful to try to pick the correct character, rather than the first one that looks right.

Once we have found the correct characters, we can use our text editor to make the changes. In Notepad++ we go to the File menu and choose Open Folder as Workspace… Then pick the top level folder in the project.

Right-click on the folder and choose Find in Files…

In the dialog that comes up, put the current character in the field marked Find what: and the desired character in Replace with: When you click on Replace in Files you will be asked to confirm the changes. It doesn’t take long for all of the changes to be made. Now, repeat the process for each of the other characters that need to be corrected.

If you would like to have these instructions for TextMate, or if you need a workflow for Linux, please contact the Help Desk.

Ocen Audio Computer Specs

Ocen Audio : which is used to edit audio files for tR projects does not have any official computer specifications for hardware at this time.  I did find this from https://tekadvisor.ca/run/ocenaudio .  This is what I received for Ocenaudio's support: Hello Mark,...

USING SYNCTHING TO SUPPORT MAST

In an earlier workflow we used USB thumb/pen drives to collect the translated output from translators. This was collected on one computer, where it was put together, missing pieces were discovered, and the whole project was eventually uploaded to Door43. One of the...

How to Build and Run a V-MAST Server (V-MAST Series 3)

Helpful Links: 7zip Balena Etcher  

Austrialia travel

If passing through Australia, (even for a short time) don't forget to get your Australian visa. Chuck US citizens, and many others, can get an electronic visa from  https://www.eta.immi.gov.au/ETAS3/etas. It costs $20AUS and takes only a few seconds (if you have all...

Gabon

e-Visa was not fast. iVisa.com provided no support, and we got our visas through the intervention of our local contact. Visa payment is made in cash at entry to the country. Officially, the visa is paid in Euros or FCFA — I was quoted $110 to pay in US currency. Two...

Converting Bible translation from MS Word/LibreOffice

The files referred to in this article can be downloaded here: Tarangan_James_Philemon While we normally encourage translation work to be done in one of our tools (Autographa, translationStudio, vMAST) sometimes it is already in process or is just better for the local...

How to format Translation Notes for publishing (V-MAST)

In V-MAST, translators are given free rein to reformat the content. However, they should not reformat content destined to be published as source text. This article specifies the expected content format. Note that BTT Writer does its own formatting when translating tN,...

Electrical Plugs

Indonesian Adapter

More Visa requirement tips

In Congo, Brazzaville, I was expected to have a transit visa, but did not. They allowed me to buy one there — $40, if I'm not mistaken. However, they then connected me with one of their "baggage handlers" who shook me down for more money. My advice is to get the visa...

VMAST Keywords Fix

What to do when you can't move beyond reconciling keywords.  How to delete hidden keywords in a v-mast server