Category Archives: PDF Translation


Tips for editing scanned PDFs – Part 2

In part 1 we covered cleaning-up a single-page, scanned PDF to make it ready for OCR. This produced a better quality scan which was then more likely to give better OCR results.

This tutorial shows a similar process but for a multi-page PDF – one that’s just too long to make editing each page by hand a viable option.

Although the example PDF used isn’t actually a scanned document (we couldn’t find one long enough!) the same steps can be applied to a scan.

Watch the short movie which makes use of the ‘Copy Across Pages’ feature of Infix.

old paper

Tips for editing scanned PDFs – Part 1

Dealing with scanned PDFs can be tricky since the quality of both the source material and the scan can vary enormously. Scans have to be processed using OCR so they need to be good quality to get useful results. If not, you’ll tend to get gibberish text which can render the whole OCR process a waste of time.

Two common problems are a lack of contrast in the scan and unwanted artifacts such as scribbles on the paper.

Fortunately Infix can help with these problems.
Watch the short movie to see how.

Part 2 shows how to achieve similar results in long PDFs where editing each page isn’t an option.


Wrapping text around awkward shapes

Sometimes you need to make your paragraphs flow around a picture or diagram, hugging the boundary but not overlapping it. This is difficult to do using just paragraphs and adjusting right and left margins.

Infix has a trick up it’s sleeve which can get you out trouble in this respect. Hidden deep in the text editing menu is the Line Width sub-menu and it’s made for just this kind of situation.

Take a look at the movie which explains how it works, much better than a short blog-post could ever do.

Watch the tutorial.

Assorted ice cream cones including chocolate, vanilla and strawberry

The 3 types of scanned PDFs

Did you know there are actually 3 different types of scanned PDF which can, if you’re not careful, complicate the task of translation:

  • The simple scan – every page is just an image.
  • Searchable scans – each image has hidden text behind it.
  • Mixed – can include scanned images, hidden and real text all in the same PDF.

TransPDF will automatically run OCR on a PDF if it detects no real text – in other words, type 1 from the list above. But for types 2 and 3 it will sense the presence of real text and skip the OCR phase. This can be a problem when you need to translate all the text in the PDF.

Infix to the rescue

Continue reading

Beautiful Latina Woman at table in Kitchen with Coupons

Ignore page headers and footers in PDF

When you want to get the text out of a PDF for translation or any other reason, headers and footers can cause problems. Often repeated across every page, they break up your text-flow and are time consuming to remove.

Fortunately it’s pretty simple to tell Infix PDF Editor to ignore header and footer regions before you export the PDF.

Use the Crop tool to drag-out a box which includes all the text you want from a typical page, but excludes the header and footer areas. Then press the Return key to finish. The next time you export the PDF, all text outside of your crop-box will be ignored.

It’s easier to see it in action, so we’ve prepared a short movie showing how it’s done. You can also read all about the Crop tool in the on-line user-manual.


MemoQ 8.1 adds TransPDF integration

Great news for MemoQ users – TransPDF is now available from right within your favorite translation tool.

Along with a host of other new features, the MemoQ update includes direct integration with TransPDF meaning that you can now do all your PDF preparation, previews and generation without ever having to leave MemoQ.

You can read all about version 8.1 at the official product page. I also recommend you take a look at their excellent step-by-step guide to handling PDF jobs with the new software and TransPDF.

And remember, as always, you can edit your translated PDFs for free using Infix PDF Editor.

jealous markup

Converting a 500-page PDF user manual with TransPDF

Translation expert Gábor Ugray has posted a fascinating series of articles on his blog documenting his efforts to translate an apparently simple iOS app.

We particularly like part 4, in which he uses TransPDF to mine information from existing PDFs to improve the translation process – a use for TransPDF we’d not even considered before.

Along the way, he’s very complimentary about TransPDF –

The really cool thing about TransPDF is that it’s able recreate a fully formatted PDF from the translated XLIFF that you upload… And when I say this is a cool thing, I really mean cool, as in way out there, extraterrestrially cool.

Check out his article which includes links to the entire series.


Worried about security when translating PDFs?

We know some of our users love the idea of our new PDF translation service – TransPDF, but can’t take the risk of uploading confidential PDFs to a public server.

Others have such high volumes of PDFs to translate, they would need an entire server all to themselves!

We think we now have the answer.
Continue reading


Translate scanned PDFs with TransPDF

We all know TransPDF makes the job of translating PDFs faster and easier than it’s ever been. If only it could handle our scanned PDFs, it’d be so much better…

Well now it can!

We’ve added automatic conversion (OCR) of scanned PDFs to make them fully editable and translatable. Simply upload your PDF to TransPDF and you’ll get beautifully clean XLIFF in return.

There’s no additional charge for the service but we do deduct your final-PDF fee at the start of the process rather than at the end like normal, even for users with a valid Infix license.

Try it now, it’s fab!



Memsource integrates with

We’re pleased to report that on-line translation specialists Memsource have launched an update to their platform which includes direct integration with TransPDF.

Now Memsource 6.0 users can go from PDF->XLIFF->PDF using the combined power of Memsource and Users will need to register for a free account then enter their new account details into the Memsource platform.

Since all new accounts get 50 free pages, some will find that’s plenty for their first PDF translations.

Read the company’s announcement for further details.

Read our own step-by-step guide to using Memsource with TransPDF.