المراجعة الحالية بتاريخ 02:02، 26 يناير 2017

محتويات

1 Optical Character Recognition
2 Current Status of Open Source Arabic OCR software
3 Resources
- 3.1 Arabic OCR Links
  - 3.1.1 Papers
  - 3.1.2 Software (FOSS)
- 3.2 Other Links

Optical Character Recognition

OCR is the ability to scan a document (or grab a PDF file) and run an OCR program on it and it will generate, based on optical recognition and approximation, an editable text file. For an idea about OCR see http://www.students.cs.uu.nl/people/mjkammer/Work/intro_2_OCR.html

Current Status of Open Source Arabic OCR software

The only FOSS OCR system with Arabic support is Tesseract, help is needed in testing and training it.

Resources

OCR from Wikipedia

Arabic OCR Links

Papers

Software (FOSS)

Tesseract is an open source OCR, initially developed by HP, and released under the Apache License. 3.x versions has Arabic support.
GOCR - included in Debian and other distributions. No Arabic support.
GNU Ocrad "is an OCR [...] program based on a feature extraction method". No Arabic support.

@@ سطر 1: / سطر 1: @@
-<div class=english>
 =Optical Character Recognition=
 OCR is the ability to scan a document (or grab a PDF file) and run an OCR program on it and it will generate, based on optical recognition and approximation, an editable text file.
@@ سطر 5: / سطر 4: @@
 = Current Status of Open Source Arabic OCR software =
+The only FOSS OCR system with Arabic support is Tesseract, help is needed in testing and training it.
-Actually (2007-08-27) The principal GPL OCR active utilities (OCRAD, GOCR, OCRE) doesn't support arabic printed text recognition. [http://directory.fsf.org/claraocr.html ClaraOCR] project seems to be inactive, but its documentation promises to work well with any horitzontal-writing language.
-==Siragi-OCR==
-[http://siragi.sourceforge.net/ SIRAGI] is an open source software designed to help blind and partially sighted people working with their computer. Visually impaired people can use this program to "listen" the content of their screen under windows or Linux/KDE. The main advantage of using SIRAGI is the support of arabic language for braille language and for speech synthesis in arabic.
-As a part of This, Siragi's developer started devoloping a FOSS OCR that should support Arabic.
-[http://www.arabeyes.org/project.php?proj=Siragi Siragi's Arabeyes Page]
 = Resources =
-* [http://www.linux-ocr.ekitap.gen.tr/ List of Linux OCR applications]
 * [http://en.wikipedia.org/wiki/Optical_character_recognition OCR from Wikipedia]

«Arabic OCR»: الفرق بين المراجعتين

المراجعة الحالية بتاريخ 02:02، 26 يناير 2017

محتويات

Optical Character Recognition

Current Status of Open Source Arabic OCR software

Resources

Arabic OCR Links

Papers

Software (FOSS)

Other Links

قائمة التصفح

أدوات شخصية

المتغيرات

نطاقات

بحث

مزيد

معاينة

تصفح

روابط سريعة

ترجمة

تطوير

فن

حول عربآيز

مشاريع خارجية

أدوات