Subtitle edit ocr not working10/7/2023 ![]() Now this does take quite a bit of extra time but usually will clean this up. Been working with SRT files for about 2+ years now, and yes, the small L, and the capital i is a pain and my biggest problem. Came here trying to find a better method, and read the post. If you really are hell bent on not using a dictionary, then you need a better algorithm because what you have will not work. I'd be inclined to dump the output of gocr to ispell (or aspell, or. Your proposed algorithm is totally flawed. The former will not spell the word "ill" under your scheme, while the latter will be littered with lower case l.Īs I already indicated, the start of a sentence will break your initial-I rule, as will proper nouns. You said that a word consisting of upper case I only is fine, even if there is more than one. The obvious answer there is to always choose, for example, a capital i even if the first letter recognised is a lower case L. Roman Numerals, but any capital i not at the beginning of a word needs changing.īut it does depend on whether the OCR engine comes across a lower case L or a capital i first, as to what needs changing and what to look for.
0 Comments
Leave a Reply.AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |