Hi all,
I'm in the process of trying to scan a Korean book into an eBook.
Being Korean-American, I can read/write Korean just fine, but English is my native language so errors in the OCR don't pop out at me like errors in English text would, so I'm having to go through it line by line to make sure things have been recognized properly.
Two problems I've been coming across in the process is that the spacing of the letters in the book are a bit narrower than normal (so the OCR program will recognize "ABC DEF GHI JKL" as "ABCDEFGHIJKL") and that small dashes/ticks on characters are sometimes missed, so it will mix up vowels like: "ㅣ ㅏ ㅑ" where just one little tick makes a difference.
I realize this is probably a margin of error that I am going to have to deal with when it comes to OCRing documents, but I was wondering if there was an editor out there that will let me go through it line by line, putting a text box of what has been OCRed under the original image, so that I can just compare it line by line rather than paragraphs at at time, like this:
And when I press enter, it would pop up the same box on the next line, and so on.
I'm using Adobe Acrobat 10 and ReadIris 12, both for mac to do the OCR. Any suggestions would be greatly appreciated! Thanks!
I'm in the process of trying to scan a Korean book into an eBook.
Being Korean-American, I can read/write Korean just fine, but English is my native language so errors in the OCR don't pop out at me like errors in English text would, so I'm having to go through it line by line to make sure things have been recognized properly.
Two problems I've been coming across in the process is that the spacing of the letters in the book are a bit narrower than normal (so the OCR program will recognize "ABC DEF GHI JKL" as "ABCDEFGHIJKL") and that small dashes/ticks on characters are sometimes missed, so it will mix up vowels like: "ㅣ ㅏ ㅑ" where just one little tick makes a difference.
I realize this is probably a margin of error that I am going to have to deal with when it comes to OCRing documents, but I was wondering if there was an editor out there that will let me go through it line by line, putting a text box of what has been OCRed under the original image, so that I can just compare it line by line rather than paragraphs at at time, like this:

And when I press enter, it would pop up the same box on the next line, and so on.
I'm using Adobe Acrobat 10 and ReadIris 12, both for mac to do the OCR. Any suggestions would be greatly appreciated! Thanks!