Every day we offer FREE licensed software you’d have to buy otherwise.
PDF Text OCR Xtractor 3.2.2.20 was available as a giveaway on September 5, 2024!
PDF Text OCR Xtractor is perfect to extract text from PDFs and all kinds of popular image formats, such as PNG, JPG, BMP, and TIFF.
PDF Text OCR Xtractor uses Tesseract OCR technology. Tesseract is perhaps the most powerful and advanced OCR software out there and here is why: First of all, a bit of history. It was developed by HP in 1994, but soon the company released it under Apache License for open-source development. In 2006, Google took over the project and sponsored developers to work on Tesseract. Fast forward now and Tesseract has become the most powerful OCR engine that uses Deep Learning to extract texts from images (BMP, PNG, JPEG, TIFF, etc.) and PDF files.
PDF Text OCR Xtractor supports 20+ different languages and lets you set custom processing parameters to source files/images, such as smoothening and DPI adjustment, increasing contrast, and other useful tricks, before analyzing them.
PDF Text OCR Xtractor has high accuracy and will get any image or PDF you have into editable searchable text. The conversion from image to text is quick.
Main Features:
1. Use of the best OCR technology available.
2. Support for 20+ different languages.
3. Useful image transformations to enhance accuracy on difficult documents.
Extra Features:
1. Cheapest Tesseract engine graphical user interface you can possibly find!
2. Support for PDF and all common image formats like PNG, JPG, BMP.
Windows 7/ 8.1/ 10/ 11 (x32/x64)
103 MB
Lifetime
$29.90
Installed on Win 11 just fine. I tested it on a few PDFs. Single-page text PDFs worked well, though some of the text formatting was funky. Forms and landscape docs didn't convert well, and multi-page docs seem to need to be converted page by page or maybe in reverse page order. But it's worth keeping for free.
Save | Cancel
installed, there are -1 all over the software. What is this?
Save | Cancel
Only english.Where is language pack ?
Save | Cancel
Looks fine, but my language is not supported and the results with it are really poor. When I tried to switch to other language (more similar to mine then the English), it didn´t work. Trying to switch, I get dialog window "Language not installed. Do you want to install it?" I answer "Yes" - an nothing happens (I tried this with a few languages, only French was switched, no other).
And even in English the results are far from perfect. Maybe a keeper (I have no OCR program license now and I need it rarely, so a free solution is fine), but far from the professional solution (ABBYY) I have used in the past.
Save | Cancel
Henry, Only English language training data is installed by default and since French uses same character set they just use the English training data for it. It is supposed to download the new training data from some web server somewhere but can't be more specific since it fails to even make a network connection on clicking "Yes" to the language not present dialog. I have left comments on the PCWinsoft blog page listed in the Tech subsection of the website which is under construction. The comments are still awaiting moderation on their blog page. https://pcwinsoft.com/index.php/2018/10/26/3-stocks-to-buy-and-hold-through-the-panic-selling/ is the URL of the blog entry... the odd name is likely because they edited a pre-existing template blog post rather than create a fresh blog entry but it is the PDF OCR.
Save | Cancel
TK, thanks for more detailed explanation. OK, just it should be written: "it supports 2+ langusges". Not "20+"...
Save | Cancel
Only English is supported.
Save | Cancel