From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from smtp.corp.redhat.com (int-mx07.intmail.prod.int.phx2.redhat.com [10.5.11.22]) by lists01.pubmisc.prod.ext.phx2.redhat.com (8.13.8/8.13.8) with ESMTP id wBHF3SmV010897 for ; Mon, 17 Dec 2018 10:03:28 -0500 Received: by smtp.corp.redhat.com (Postfix) id BB2971059581; Mon, 17 Dec 2018 15:03:28 +0000 (UTC) Received: from mx1.redhat.com (ext-mx03.extmail.prod.ext.phx2.redhat.com [10.5.110.27]) by smtp.corp.redhat.com (Postfix) with ESMTPS id AFF411059593 for ; Mon, 17 Dec 2018 15:03:26 +0000 (UTC) Received: from mail-ot1-f49.google.com (mail-ot1-f49.google.com [209.85.210.49]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 7EAAE81138 for ; Mon, 17 Dec 2018 15:03:25 +0000 (UTC) Received: by mail-ot1-f49.google.com with SMTP id w25so12369483otm.13 for ; Mon, 17 Dec 2018 07:03:25 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=fsHrFtQpYB9pI7mvePDasmbwTTVIOEIlNBbHg4J4ef0=; b=gt0DYlvnobp38KaKhAhI0lQ+drrhMMDblgfbA0bec0E6RKU3CbzoCDPxUSpyXQ46Vx +I4RBLfO1oGSX8wYt9gga/WSArfgGhiwJzs5i6JcFaEf2rwPDECAZ6N/fcPD8/73knoF ORP4pKunC9z72edeLzHv2QLwTw7lKSBUmm6MGDExjG9LU//2ZOUC2skHVoxPCQ0ehRjc JNrP5R3x4WpMPipUk5ko+6rpQQuVipH6x7qlpp3bVS7ebmDCUgKbBInAr22eIeicdXyQ ejIoxGMQB7drqu8mZ7wNYdm5hN7nii8o89cToGBuiI7J2pk/NICZYGZmPdYJaAAfzcRR txXA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=fsHrFtQpYB9pI7mvePDasmbwTTVIOEIlNBbHg4J4ef0=; b=JdHMEps/lUafY7fEsC/+wd2+PY4KzwqT7IOMdXmHA0CtU/tuVeqxaUSGKVFRTAzZ4M qDxCyFuO85wsYJDRiQuLjhAFwyXrW67+nrVzyjG3+DQfJblu1uzKI8VVYaDPH/9eb098 sL57ZtagAzktcwswSMxfJzqWzDd9+crlTyyyA8EtMcqoQCB3C+vM4WnTqRQVegKvNmdd HOTp81tq0xuCz9/n03lAyai1n7J5i+L3Mg+cpoDtuqRT57sxIRPkELiYrWgl22S4CUCv CIyM/oCBxUhNSLQLs7FRJGlG5hg6WpcNtXmOMOdxkS88KJ72hXFEeMfalNKPjRX0Z0ru fXXw== X-Gm-Message-State: AA+aEWbV8b8fhinbsw0w82ck4vteSjcDUzi5k9KK2/Ye/jtEopb4Gebs uLmAy66b0pvcvVUzmcK2gcdQgGFLVrnHRGcpkO04ZQ== X-Google-Smtp-Source: AFSGD/WoowPzaZhQWrm78nyn4g8IgkZdWY6dtgHqgmPdUWhwtwwp4EfmKXUAdiytVlsspiiKAOeYtON978GCtfnp4eg= X-Received: by 2002:a05:6830:120c:: with SMTP id r12mr10355704otp.252.1545059004473; Mon, 17 Dec 2018 07:03:24 -0800 (PST) MIME-Version: 1.0 Received: by 2002:a05:6830:2158:0:0:0:0 with HTTP; Mon, 17 Dec 2018 07:03:23 -0800 (PST) In-Reply-To: References: Date: Mon, 17 Dec 2018 15:03:23 +0000 Message-ID: Subject: Re: extracting text from png files To: blinux-list@redhat.com Content-Type: text/plain; charset="UTF-8" X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.27]); Mon, 17 Dec 2018 15:03:25 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.27]); Mon, 17 Dec 2018 15:03:25 +0000 (UTC) for IP:'209.85.210.49' DOMAIN:'mail-ot1-f49.google.com' HELO:'mail-ot1-f49.google.com' FROM:'mewtamer@gmail.com' RCPT:'' X-RedHat-Spam-Score: -0.11 (DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, FREEMAIL_FROM, RCVD_IN_DNSWL_NONE, SPF_PASS) 209.85.210.49 mail-ot1-f49.google.com 209.85.210.49 mail-ot1-f49.google.com X-Scanned-By: MIMEDefang 2.78 on 10.5.110.27 X-Scanned-By: MIMEDefang 2.84 on 10.5.11.22 X-loop: blinux-list@redhat.com From: Linux for blind general discussion X-BeenThere: blinux-list@redhat.com X-Mailman-Version: 2.1.12 Precedence: junk Reply-To: blinux-list@redhat.com List-Id: Linux for blind general discussion List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 17 Dec 2018 15:03:28 -0000 What you're looking for is Ocular Character Recognition or OCR for short. I've never managed to figure out its command line syntax, but I believe tesseract is considered the current standard option for Linux. There's also Cuneiform, which I have actually used with some success in the past, but I believe its either contrib or non-free under Debian, so you might need to enable extra repositories depending on how strict your distro is about sticking to FOSS principles. I will warn you, in my experience, OCR is as likely to produce gibberish as legible text. A scan of a page of prose type set in a standard font will probably OCR well, but the more mixed text is with graphics, the fancier the font, and the more complicated the page layout, the more likely errors are. I've tried OCR'ing scanlated manga(Japanese comics) in the past and have gotten results that included unpredictible patterns of letters and numbers misidentified as others(S and 5, P and D, I and 1, LI and U, B and g where just some of the common substitutions I encountered trying to fix the OCR'd text), characters my screenreader could'nt identify or identified as characters I'm unfamiliar, and even when the text was clear, paragraphs out of order wasn't uncommon. -- Sincerely, Jeffery Wright Bachelor of Computer Science President Emeritus, Nu Nu Chapter, Phi Theta Kappa.