Rocksolid Light

News from da outaworlds

mail  files  register  groups  login

Message-ID:  

Talkers are no good doers. -- William Shakespeare, "Henry VI"


comp / comp.text.pdf / Re: Irfanview on Windows

SubjectAuthor
* OCR on WindowsBill Powell
+* Re: OCR on Windowsmicky
|+- Re: OCR on WindowsBill Powell
|`* Re: OCR on WindowsPeter Flynn
| `- Re: OCR on WindowsPaul
+* Re: OCR on WindowsNewyana2
|+* Re: OCR on Windowsmicky
||`* Re: OCR on WindowsJeff Barnett
|| `* Re: OCR on WindowsNewyana2
||  `* Re: OCR on Windowsmicky
||   `- Re: OCR on WindowsWolf Greenblatt
|`* Re: OCR on WindowsPaul in Houston TX
| `* Re: OCR on WindowsNick Cine
|  `- Re: OCR on WindowsBill Powell
+* Re: OCR on Windowscable shill
|`* Re: OCR on WindowsStan Brown
| `* Re: OCR on WindowsJan K.
|  `- Re: OCR on WindowsBig Al
+* Re: OCR on WindowsStan Brown
|+* Re: OCR on WindowsNewyana2
||`- Re: OCR on Windowsdavid
|`* Re: OCR on WindowsEnrico Papaloma
| `* Re: OCR on WindowsJoerg Walther
|  `- Re: OCR on WindowsStan Brown
+* Re: OCR on WindowsHerbert Kleebauer
|+* Re: OCR on Windowsknuttle
||+- Re: OCR on WindowsIsaac Montara
||+- Re: OCR on WindowsStan Brown
||`* Re: Irfanview on Windowswasbit
|| `* Re: Irfanview on WindowsSteve Hayes
||  `- Re: Irfanview on WindowsAndrew
|`* Re: OCR on WindowsStan Brown
| +* Re: OCR on WindowsJørgen Nielsen
| |`- Re: OCR on WindowsStan Brown
| `* Re: OCR on WindowsHerbert Kleebauer
|  +* Re: OCR on WindowsPaul
|  |`- Re: OCR on WindowsHerbert Kleebauer
|  +* Re: OCR on WindowsStan Brown
|  |`- Re: OCR on WindowsPaul
|  `- Re: OCR on Windowswasbit
+- Re: OCR on WindowsJim the Geordie
`- Re: OCR on WindowsMr. Man-wai Chang

Pages:12
Subject: Re: OCR on Windows
From: Stan Brown
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: Oak Road Systems
Date: Mon, 15 Jul 2024 20:09 UTC
References: 1 2
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: the_stan_brown@fastmail.fm (Stan Brown)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Mon, 15 Jul 2024 13:09:41 -0700
Organization: Oak Road Systems
Lines: 20
Message-ID: <MPG.40ff21e3df82a0c4990311@news.individual.net>
References: <v6v74c$80bq$1@matrix.hispagatos.org> <v6vugl$1lsq$1@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
X-Trace: individual.net HcLMXUcEG0KGBB0hDoOnzAtC9/W33+GzQOXLI5v+wWcYQUBEf3
Cancel-Lock: sha1:hcZt50wl12ZDOFmX6pdu5Oji0nY= sha256:ReUItKtdmOsimQgF5sG9xbjCUyTBSo/kc0UORPEaiyE=
User-Agent: MicroPlanet-Gravity/3.0.11 (GRC)
View all headers

On Sun, 14 Jul 2024 09:25:09 +0200, Herbert Kleebauer wrote:
> On 14.07.2024 02:46, Bill Powell wrote:
>
> > I have a series of one-page PDFs that are really images and not text even
> > though they look like they're just a page of simple text in the same font.
> >
> > Is there a way to easily OCR a PDF to actual text on Windows for free?
>
> For only a few lines of text you can use the Snipping Tool: press
> <WIN><SHIFT>S and select the part of the screen with the text.
> When the Snipping Tool opens, select the OCR function.

What OCR function? I just get a menu at the top of the screen
consisting of five icons: Rectangular snip, Freeform snip, Window
snip, Fullscreen snip, Close snipping.

--
Stan Brown, Tehachapi, California, USA https://BrownMath.com/
Shikata ga nai...

Subject: Re: OCR on Windows
From: Stan Brown
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: Oak Road Systems
Date: Mon, 15 Jul 2024 20:11 UTC
References: 1 2 3
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: the_stan_brown@fastmail.fm (Stan Brown)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Mon, 15 Jul 2024 13:11:02 -0700
Organization: Oak Road Systems
Lines: 15
Message-ID: <MPG.40ff22306ad30b9990312@news.individual.net>
References: <v6v74c$80bq$1@matrix.hispagatos.org> <v6vugl$1lsq$1@dont-email.me> <v70aoo$3pl7$1@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
X-Trace: individual.net GYBp3yP0cMmAalgfVujs7giykQZ1x5p7zJTf7ei6mFCV7aGmTQ
Cancel-Lock: sha1:7iJiDI+vqjkTqlrmX+P0SCyWbVg= sha256:bDX6epf1yiAd386EOirOwrh/afy8YIRnazRs85i1sec=
User-Agent: MicroPlanet-Gravity/3.0.11 (GRC)
View all headers

On Sun, 14 Jul 2024 06:54:16 -0400, knuttle wrote:
> I use Irfanveiw for all my image and OCR projects.
>
> You need Irfanview and the OCR plugin.
>
> Open the PDF file in Irfanvieiw, high lite the text and activate the
> OCR function.

I've been using Irfanview for years, but when I tried the OCR plugin
I found it did a significantly worse job than OneNote.

--
Stan Brown, Tehachapi, California, USA https://BrownMath.com/
Shikata ga nai...

Subject: Re: OCR on Windows
From: Stan Brown
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: Oak Road Systems
Date: Mon, 15 Jul 2024 20:19 UTC
References: 1 2 3 4
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: the_stan_brown@fastmail.fm (Stan Brown)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Mon, 15 Jul 2024 13:19:13 -0700
Organization: Oak Road Systems
Lines: 25
Message-ID: <MPG.40ff241d22dc2383990313@news.individual.net>
References: <v6v74c$80bq$1@matrix.hispagatos.org> <MPG.40fd05da559d2e4b99030b@news.individual.net> <v71aie$2j22$1@news.gegeweb.eu> <0am99j948eg65l5khojo9dnkhocb4uf59o@joergwalther.my-fqdn.de>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
X-Trace: individual.net uatmyeMwZ6B/VWwISiNrYw/SUoMUHeXA9DTgz4GUtk+ruT2Z1F
Cancel-Lock: sha1:y4PN31jlZj9yv4LBuoHduzeuI0s= sha256:eLWUWWJocIH0m0/O9x8LCdFsAcuo7+m9v8Mz0b/wiig=
User-Agent: MicroPlanet-Gravity/3.0.11 (GRC)
View all headers

On Mon, 15 Jul 2024 10:10:05 +0200, Joerg Walther wrote:
>
> Enrico Papaloma wrote:
>
> >Download PDF-XChange Editor/Plus (32/64 Bit Version) (as ZIP File)
> >Download PDF-XChange Editor PORTABLE (32/64 Bit Version) (as ZIP File)
> >Download PDF-XChange Editor PORTABLE ohne OCR (32/64 Bit Version) (as ZIP File)
> >
> >It says "ohne OCR". What does "ohne" mean anyway?
>
> Ohne is German,meaning "without".

As in /Die Frau Ohne Schatten/ (The Woman without a Shadow), an
unjustly neglected opera by Richard Strauss.

I recognize several of the singers' names in this video, so it ought
to be a good performance, but I haven't listened to it because I have
one on CD:

https://www.youtube.com/watch?v=rFfc_rP9ROk

--
Stan Brown, Tehachapi, California, USA https://BrownMath.com/
Shikata ga nai...

Subject: Re: OCR on Windows
From: Jørgen Nielsen
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: A noiseless patient Spider
Date: Mon, 15 Jul 2024 20:49 UTC
References: 1 2 3
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: newsjn@outlook.dk (Jørgen Nielsen)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Mon, 15 Jul 2024 22:49:46 +0200
Organization: A noiseless patient Spider
Lines: 24
Message-ID: <mn.7d597e874f30b693.145345@outlook.dk>
References: <v6v74c$80bq$1@matrix.hispagatos.org> <v6vugl$1lsq$1@dont-email.me> <MPG.40ff21e3df82a0c4990311@news.individual.net>
Reply-To: newsjn@outlook.dk
MIME-Version: 1.0
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: 8bit
Injection-Date: Mon, 15 Jul 2024 22:48:50 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="0716307f155ea8b3a6cd5215fc96cdbe";
logging-data="918651"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+dt37oKXp7VIxb7892Aak1"
Cancel-Lock: sha1:gs8oMgbL3CJGXn9hEJrvHZng0MM=
X-Newsreader: MesNews/1.08.06.00-da
View all headers

mandag, 15-07-2024, Stan Brown skrev:
> On Sun, 14 Jul 2024 09:25:09 +0200, Herbert Kleebauer wrote:
>> On 14.07.2024 02:46, Bill Powell wrote:
>>
>>> I have a series of one-page PDFs that are really images and not text even
>>> though they look like they're just a page of simple text in the same font.
>>>
>>> Is there a way to easily OCR a PDF to actual text on Windows for free?
>>
>> For only a few lines of text you can use the Snipping Tool: press
>> <WIN><SHIFT>S and select the part of the screen with the text.
>> When the Snipping Tool opens, select the OCR function.
>
>
> What OCR function? I just get a menu at the top of the screen
> consisting of five icons: Rectangular snip, Freeform snip, Window
> snip, Fullscreen snip, Close snipping.
>
Select Rectangular snip, select the text, double click on Snipping
Tools, click on text in the menu, select the text and copy.

--
Mvh. Jørgen
[e-mail address is valid]

Subject: Re: OCR on Windows
From: Herbert Kleebauer
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: A noiseless patient Spider
Date: Mon, 15 Jul 2024 21:01 UTC
References: 1 2 3
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: klee@unibwm.de (Herbert Kleebauer)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Mon, 15 Jul 2024 23:01:24 +0200
Organization: A noiseless patient Spider
Lines: 23
Message-ID: <v742p0$s6vu$1@dont-email.me>
References: <v6v74c$80bq$1@matrix.hispagatos.org>
<v6vugl$1lsq$1@dont-email.me>
<MPG.40ff21e3df82a0c4990311@news.individual.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Mon, 15 Jul 2024 23:02:24 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="d47603e95182692cda8bfd96f3ee2c64";
logging-data="924670"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/YGuitl7ytdlYBC9F3S5g8bICSc0S5d4U="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:+hDlz6507JaN0ev1sG3fkQyw+yE=
In-Reply-To: <MPG.40ff21e3df82a0c4990311@news.individual.net>
Content-Language: de-DE
View all headers

On 15.07.2024 22:09, Stan Brown wrote:

>> For only a few lines of text you can use the Snipping Tool: press
>> <WIN><SHIFT>S and select the part of the screen with the text.
>> When the Snipping Tool opens, select the OCR function.
>
>
> What OCR function? I just get a menu at the top of the screen
> consisting of five icons: Rectangular snip, Freeform snip, Window
> snip, Fullscreen snip, Close snipping.

Maybe it is only available in Win11 but not in Win10.
I have version: Snipping Tool 11.2405.32.0

https://support.microsoft.com/en-us/windows/use-snipping-tool-to-capture-screenshots-00246869-1843-655f-f220-97299b865f6b#ID0EDD=Windows_11

|| Once you've captured a snip, select the Text Actions button to
|| activate the Optical Character Recognition (OCR) feature. This
|| allows you to extract text directly from your image. From here,
|| you have the option to either select and copy specific text, or
|| use the tools to Copy all text or to Quick redact. All text
|| recognition processes are performed locally on your

Subject: Re: OCR on Windows
From: Paul
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: A noiseless patient Spider
Date: Tue, 16 Jul 2024 05:18 UTC
References: 1 2 3 4
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: nospam@needed.invalid (Paul)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Tue, 16 Jul 2024 01:18:40 -0400
Organization: A noiseless patient Spider
Lines: 43
Message-ID: <v74vri$14okb$1@dont-email.me>
References: <v6v74c$80bq$1@matrix.hispagatos.org>
<v6vugl$1lsq$1@dont-email.me>
<MPG.40ff21e3df82a0c4990311@news.individual.net>
<v742p0$s6vu$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit
Injection-Date: Tue, 16 Jul 2024 07:18:42 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="45bbe8a8ff1ba6ab958be21473ec9f5b";
logging-data="1204875"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18qz4lZhsjiXx9UPmALFvxrkdzcuODmZdw="
User-Agent: Ratcatcher/2.0.0.25 (Windows/20130802)
Cancel-Lock: sha1:YVL8etv1rOyYPFvLB2XZXYD7jLc=
In-Reply-To: <v742p0$s6vu$1@dont-email.me>
Content-Language: en-US
X-Mozilla-News-Host: news://nntp.aioe.org
View all headers

On 7/15/2024 5:01 PM, Herbert Kleebauer wrote:
> On 15.07.2024 22:09, Stan Brown wrote:
>
>>> For only a few lines of text you can use the Snipping Tool: press
>>> <WIN><SHIFT>S and select the part of the screen with the text.
>>> When the Snipping Tool opens, select the OCR function.
>>
>>
>> What OCR function? I just get a menu at the top of the screen
>> consisting of five icons: Rectangular snip, Freeform snip, Window
>> snip, Fullscreen snip, Close snipping.
>
>
> Maybe it is only available in Win11 but not in Win10.
> I have version: Snipping Tool 11.2405.32.0
>
> https://support.microsoft.com/en-us/windows/use-snipping-tool-to-capture-screenshots-00246869-1843-655f-f220-97299b865f6b#ID0EDD=Windows_11
>
> || Once you've captured a snip, select the Text Actions button to
> || activate the Optical Character Recognition (OCR) feature. This
> || allows you to extract text directly from your image. From here,
> || you have the option to either select and copy specific text, or
> || use the tools to Copy all text or to Quick redact. All text
> || recognition processes are performed locally on your

This is what I'm seeing.

[Picture]

https://i.postimg.cc/BnZCqsSV/snippingtool-OCR-is-implicit.gif

You select "text actions" first.

The OCR conversion happens upon entry to the function,
with no request on your part.

The "Copy as Text" is presumably supposed to trigger "OCR was done"
in your brain ??? A violation of discover-ability. Or of some other
principle they might have taught in CS school.

Paul

Subject: Re: OCR on Windows
From: Herbert Kleebauer
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: A noiseless patient Spider
Date: Tue, 16 Jul 2024 06:43 UTC
References: 1 2 3 4 5
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: klee@unibwm.de (Herbert Kleebauer)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Tue, 16 Jul 2024 08:43:11 +0200
Organization: A noiseless patient Spider
Lines: 9
Message-ID: <v754tk$159aa$1@dont-email.me>
References: <v6v74c$80bq$1@matrix.hispagatos.org>
<v6vugl$1lsq$1@dont-email.me>
<MPG.40ff21e3df82a0c4990311@news.individual.net>
<v742p0$s6vu$1@dont-email.me> <v74vri$14okb$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Tue, 16 Jul 2024 08:45:09 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="1f2c75322b9c785bbbda273d25d0bcc7";
logging-data="1221962"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+ytrSd6i9Y8SqbKQy7PbdYB0XrTeTNgkA="
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:ClG2r1ZxuwAFtgo+Rue9OCAXcw8=
In-Reply-To: <v74vri$14okb$1@dont-email.me>
Content-Language: en-US
View all headers

On 16.07.2024 07:18, Paul wrote:

> The "Copy as Text" is presumably supposed to trigger "OCR was done"
> in your brain ??? A violation of discover-ability. Or of some other
> principle they might have taught in CS school.

I think it is a good idea to replace the keyboard sequence CTRL-A CTRL-C
by a simple mouse click. And there is also the button to remove email
addresses and phone numbers from t

Subject: Re: OCR on Windows
From: Stan Brown
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: Oak Road Systems
Date: Thu, 18 Jul 2024 22:06 UTC
References: 1 2 3 4
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: the_stan_brown@fastmail.fm (Stan Brown)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Thu, 18 Jul 2024 15:06:25 -0700
Organization: Oak Road Systems
Lines: 30
Message-ID: <MPG.410331c0dc3b5a5899031d@news.individual.net>
References: <v6v74c$80bq$1@matrix.hispagatos.org> <v6vugl$1lsq$1@dont-email.me> <MPG.40ff21e3df82a0c4990311@news.individual.net> <mn.7d597e874f30b693.145345@outlook.dk>
Mime-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: 8bit
X-Trace: individual.net 3vISiWfmZ8ZCVEyx8xEXogPVfaF5bnwJQZgd4JS/kcNFFoibTR
Cancel-Lock: sha1:FpG/j6K6ml27JfSRutuvzjYrI+0= sha256:AKC0lcwMWZORjbZqlMIJ9SzyHzcZZjjSHv93TOvdA6I=
User-Agent: MicroPlanet-Gravity/3.0.11 (GRC)
View all headers

On Mon, 15 Jul 2024 22:49:46 +0200, Jørgen Nielsen wrote:
>
> mandag, 15-07-2024, Stan Brown skrev:
> > On Sun, 14 Jul 2024 09:25:09 +0200, Herbert Kleebauer wrote:
> >> On 14.07.2024 02:46, Bill Powell wrote:
> >>
> >>> I have a series of one-page PDFs that are really images and not text even
> >>> though they look like they're just a page of simple text in the same font.
> >>>
> >>> Is there a way to easily OCR a PDF to actual text on Windows for free?
> >>
> >> For only a few lines of text you can use the Snipping Tool: press
> >> <WIN><SHIFT>S and select the part of the screen with the text.
> >> When the Snipping Tool opens, select the OCR function.
> >
> >
> > What OCR function? I just get a menu at the top of the screen
> > consisting of five icons: Rectangular snip, Freeform snip, Window
> > snip, Fullscreen snip, Close snipping.
> >
> Select Rectangular snip, select the text, double click on Snipping
> Tools, click on text in the menu, select the text and copy.

As soon as I begin selecting text, the Sniping Tools icon menu at
the top of the screen disappears, so there's nothing to double-click
on.

--
Stan Brown, Tehachapi, California, USA https://BrownMath.com/
Shikata ga nai...

Subject: Re: OCR on Windows
From: Stan Brown
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: Oak Road Systems
Date: Thu, 18 Jul 2024 22:10 UTC
References: 1 2 3 4
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: the_stan_brown@fastmail.fm (Stan Brown)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Thu, 18 Jul 2024 15:10:16 -0700
Organization: Oak Road Systems
Lines: 23
Message-ID: <MPG.410332a080124bf99031e@news.individual.net>
References: <v6v74c$80bq$1@matrix.hispagatos.org> <v6vugl$1lsq$1@dont-email.me> <MPG.40ff21e3df82a0c4990311@news.individual.net> <v742p0$s6vu$1@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
X-Trace: individual.net UsKKva5vpw9T3HZffX73KwY3t2k3fTnhKwgqu8Y+b77Pn1rVU9
Cancel-Lock: sha1:ZO0+EO0INDdnxRkM813j3cwDJss= sha256:PD4gz2Agig7VzneT9Mx3NdVEJ+1zaet+M0HH1/BlOEQ=
User-Agent: MicroPlanet-Gravity/3.0.11 (GRC)
View all headers

On Mon, 15 Jul 2024 23:01:24 +0200, Herbert Kleebauer wrote:
> On 15.07.2024 22:09, Stan Brown wrote:
>
> >> For only a few lines of text you can use the Snipping Tool: press
> >> <WIN><SHIFT>S and select the part of the screen with the text.
> >> When the Snipping Tool opens, select the OCR function.

I did mot write the above paragraph.

> > What OCR function? I just get a menu at the top of the screen
> > consisting of five icons: Rectangular snip, Freeform snip, Window
> > snip, Fullscreen snip, Close snipping.
>
>
> Maybe it is only available in Win11 but not in Win10.
> I have version: Snipping Tool 11.2405.32.0

Oh, silly me. We're in a Windows 10 newsgroup, so I thought we were
talking about a Windows 10 feature.

--
Stan Brown, Tehachapi, California, USA https://BrownMath.com/
Shikata ga nai...

Subject: Re: Irfanview on Windows
From: wasbit
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: A noiseless patient Spider
Date: Fri, 19 Jul 2024 09:05 UTC
References: 1 2 3
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: wasbit@nowhere.com (wasbit)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: Irfanview on Windows
Date: Fri, 19 Jul 2024 10:05:59 +0100
Organization: A noiseless patient Spider
Lines: 34
Message-ID: <v7da9n$2ttra$1@dont-email.me>
References: <v6v74c$80bq$1@matrix.hispagatos.org>
<v6vugl$1lsq$1@dont-email.me> <v70aoo$3pl7$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 19 Jul 2024 11:06:00 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="9bb3828df04e45da71a1fcda34b357b2";
logging-data="3077994"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/lRa0/Eif/YmmExJmoVmGl"
User-Agent: Mozilla/5.0 (Windows NT 6.3; Win64; x64; rv:5.0) Aura/20220608
Interlink/52.9.8194
Cancel-Lock: sha1:3FCbf3mtihFFtuvReZH9+sVNrCw=
In-Reply-To: <v70aoo$3pl7$1@dont-email.me>
Content-Language: en-US
View all headers

On 14/07/2024 11:54, knuttle wrote:
> On 07/14/2024 3:25 AM, Herbert Kleebauer wrote:
>> On 14.07.2024 02:46, Bill Powell wrote:
>>
>>> I have a series of one-page PDFs that are really images and not text
>>> even
>>> though they look like they're just a page of simple text in the same
>>> font.
>>>
>>> Is there a way to easily OCR a PDF to actual text on Windows for free?
>>
>> For only a few lines of text you can use the Snipping Tool: press
>> <WIN><SHIFT>S and select the part of the screen with the text.
>> When the Snipping Tool opens, select the OCR function.
>>
>> Or you can use Firefox to display the pdf and and use an OCR
>> plug-in.
>>
> I use Irfanveiw for all my image and OCR projects.
>
> You need Irfanview and the OCR plugin.
>
> Open the PDF file in  Irfanvieiw, high lite the text and activate the
> OCR function.

I recently had to sort out an XP machine with some 500 wrongly named &
corrupted files that contained photos.
I was pleasantly surprised at the number of different types of file that
Irfanview would open, play & sort out the correct extension. Save me
hundreds of clicks & hours of work.

--
Regards
wasbit

Subject: Re: OCR on Windows
From: wasbit
Newsgroups: alt.comp.os.windows-10, comp.text.pdf
Organization: A noiseless patient Spider
Date: Fri, 19 Jul 2024 09:13 UTC
References: 1 2 3 4
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: wasbit@nowhere.com (wasbit)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Fri, 19 Jul 2024 10:13:56 +0100
Organization: A noiseless patient Spider
Lines: 29
Message-ID: <v7daoj$2u0l5$1@dont-email.me>
References: <v6v74c$80bq$1@matrix.hispagatos.org>
<v6vugl$1lsq$1@dont-email.me>
<MPG.40ff21e3df82a0c4990311@news.individual.net>
<v742p0$s6vu$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Fri, 19 Jul 2024 11:13:56 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="b7bd5b2a1351ed1e953705f6e4be76da";
logging-data="3080869"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/ltLAd6jLI1IUaojbREHmD"
User-Agent: Mozilla/5.0 (Windows NT 6.3; Win64; x64; rv:5.0) Aura/20220608
Interlink/52.9.8194
Cancel-Lock: sha1:4AKiTNBHJ1plAFr3Nxa1ejwmPfY=
Content-Language: en-US
In-Reply-To: <v742p0$s6vu$1@dont-email.me>
View all headers

On 15/07/2024 22:01, Herbert Kleebauer wrote:
> On 15.07.2024 22:09, Stan Brown wrote:
>
> >> For only a few lines of text you can use the Snipping Tool: press
> >> <WIN><SHIFT>S and select the part of the screen with the text.
> >> When the Snipping Tool opens, select the OCR function.
> >
> >
> > What OCR function? I just get a menu at the top of the screen
> > consisting of five icons: Rectangular snip, Freeform snip, Window
> > snip, Fullscreen snip, Close snipping.
>
>
> Maybe it is only available in Win11 but not in Win10.
> I have version: Snipping Tool 11.2405.32.0
>
> https://support.microsoft.com/en-us/windows/use-snipping-tool-to-capture-screenshots-00246869-1843-655f-f220-97299b865f6b#ID0EDD=Windows_11
>
>

FYI
The snipping tool is available in Windows 8.1.
A better name would be Screenshot tool. I use it on a regular basis.

--
Regards
wasbit

Subject: Re: Irfanview on Windows
From: Steve Hayes
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: Khanya Publications
Date: Fri, 19 Jul 2024 09:35 UTC
References: 1 2 3 4
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: hayesstw@telkomsa.net (Steve Hayes)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: Irfanview on Windows
Date: Fri, 19 Jul 2024 11:35:06 +0200
Organization: Khanya Publications
Lines: 16
Message-ID: <5ock9jpp1ori5r6qgbrn3apl8u5270o86b@4ax.com>
References: <v6v74c$80bq$1@matrix.hispagatos.org> <v6vugl$1lsq$1@dont-email.me> <v70aoo$3pl7$1@dont-email.me> <v7da9n$2ttra$1@dont-email.me>
Reply-To: hayesstw@yahoo.com
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Date: Fri, 19 Jul 2024 11:30:42 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="887ff23a9aefce85e11140c29a43a1e4";
logging-data="3081977"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19tPge3rMaiUvgLCZGBnM81vwjmChLTP10="
Cancel-Lock: sha1:9I6Fi7KHkZs5vetzO7WItU/8VeY=
X-No-Archive: yes
X-Newsreader: Forte Free Agent 2.0/32.652
View all headers

On Fri, 19 Jul 2024 10:05:59 +0100, wasbit <wasbit@nowhere.com> wrote:

>I recently had to sort out an XP machine with some 500 wrongly named &
>corrupted files that contained photos.
>I was pleasantly surprised at the number of different types of file that
>Irfanview would open, play & sort out the correct extension. Save me
>hundreds of clicks & hours of work.

I find Irfanview very useful for all kinds of graphics tasks.

--
Steve Hayes from Tshwane, South Africa
Web: http://www.khanya.org.za/stevesig.htm
Blog: http://khanya.wordpress.com
E-mail - see web page, or parse: shayes at dunelm full stop org full stop uk

Subject: Re: OCR on Windows
From: Paul
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: A noiseless patient Spider
Date: Fri, 19 Jul 2024 15:17 UTC
References: 1 2 3 4 5
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: nospam@needed.invalid (Paul)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Fri, 19 Jul 2024 11:17:54 -0400
Organization: A noiseless patient Spider
Lines: 52
Message-ID: <v7e032$31o3q$1@dont-email.me>
References: <v6v74c$80bq$1@matrix.hispagatos.org>
<v6vugl$1lsq$1@dont-email.me>
<MPG.40ff21e3df82a0c4990311@news.individual.net>
<v742p0$s6vu$1@dont-email.me> <MPG.410332a080124bf99031e@news.individual.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit
Injection-Date: Fri, 19 Jul 2024 17:17:54 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="b6de6bc59fba19ce9333ce6caaf226dd";
logging-data="3203194"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+/JJhEmotUInUOVB9sEyMKJ3QK3fJ7574="
User-Agent: Ratcatcher/2.0.0.25 (Windows/20130802)
Cancel-Lock: sha1:TB6vXs75LcvNCe2+ylqN77+YWvQ=
In-Reply-To: <MPG.410332a080124bf99031e@news.individual.net>
Content-Language: en-US
View all headers

On 7/18/2024 6:10 PM, Stan Brown wrote:
> On Mon, 15 Jul 2024 23:01:24 +0200, Herbert Kleebauer wrote:
>> On 15.07.2024 22:09, Stan Brown wrote:
>>
>> >> For only a few lines of text you can use the Snipping Tool: press
>> >> <WIN><SHIFT>S and select the part of the screen with the text.
>> >> When the Snipping Tool opens, select the OCR function.
>
> I did mot write the above paragraph.
>
>> > What OCR function? I just get a menu at the top of the screen
>> > consisting of five icons: Rectangular snip, Freeform snip, Window
>> > snip, Fullscreen snip, Close snipping.
>>
>>
>> Maybe it is only available in Win11 but not in Win10.
>> I have version: Snipping Tool 11.2405.32.0
>
> Oh, silly me. We're in a Windows 10 newsgroup, so I thought we were
> talking about a Windows 10 feature.
>

Windows 10 has two programs.

SnippingTool.exe is a win32 program, with a WinAmp-tiny interface and no features.
You would not expect to find any functions "sandwiched" into that.

But they also have "Snip and Sketch" Metro.App, with decorations suspiciously
similar to the Windows 11 "SnippingTool" Metro.App . Snip and Sketch is likely
the fast prototype version of the SnippingTool that ships on Windows 11.

Apparently, for a short time, a Text Actions was exposed on Win10 "Snip and Sketch",
but only for A/B testing (only a percentage of users would see it, and perhaps
with no warning either), and presumably completely removed again afterwards.

Search engines are pretty useless for tracking stuff like this. Using
relatively neutral keywords, as an example, I got one "result" on one page,
for one of my queries, almost like the topic was "verboten".

*******

One thing that is of minor interest, is OCR is part of .NET .

https://learn.microsoft.com/en-us/samples/microsoft/windows-universal-samples/ocr/

Without some sort of development history ("where did it come from"),
I doubt a lot of developers would invest time quantifying it
for suitability in a product. All the OCR things I've ever tested,
have sucked, so my going-in assumption when a new one shows up,
is it will be more of the same.

Paul

Subject: Re: Irfanview on Windows
From: Andrew
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: BWH Usenet Archive (https://usenet.blueworldhosting.com)
Date: Fri, 19 Jul 2024 15:52 UTC
References: 1 2 3 4 5
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!tncsrv06.tnetconsulting.net!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!nnrp.usenet.blueworldhosting.com!.POSTED!not-for-mail
From: andrew@spam.net (Andrew)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: Irfanview on Windows
Date: Fri, 19 Jul 2024 15:52:32 -0000 (UTC)
Organization: BWH Usenet Archive (https://usenet.blueworldhosting.com)
Message-ID: <v7e23v$2enm$1@nnrp.usenet.blueworldhosting.com>
References: <v6v74c$80bq$1@matrix.hispagatos.org> <v6vugl$1lsq$1@dont-email.me> <v70aoo$3pl7$1@dont-email.me> <v7da9n$2ttra$1@dont-email.me> <5ock9jpp1ori5r6qgbrn3apl8u5270o86b@4ax.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 19 Jul 2024 15:52:32 -0000 (UTC)
Injection-Info: nnrp.usenet.blueworldhosting.com;
logging-data="80630"; mail-complaints-to="usenet@blueworldhosting.com"
User-Agent: NewsTap/5.5 (iPad)
Cancel-Lock: sha1:RxJfWKVpcBrItvZwAaSHGwEPTXg= sha256:9nSBShoFwoT23u98uXfdxvLwE49CzElAlcT8OwxzgXE=
sha1:26woAEXi/V+f7o7mHtaCfn8tWKs= sha256:I2l+m48U6EUh9nZhmkiWLJWnYresuzeqxur0aU4m8kU=
X-Face: VQ}*Ueh[4uTOa]Md([|$jb%rw~ksq}bzqA;z-.*8JM`4+zL[`N\ORHCI80}]}$]$e5]/i#v qdYsE`yh@ZL3L{H:So{yN)b=AZJtpaP98ch_4W}
View all headers

Steve Hayes wrote on Fri, 19 Jul 2024 11:35:06 +0200 :

>>I recently had to sort out an XP machine with some 500 wrongly named &
>>corrupted files that contained photos.
>>I was pleasantly surprised at the number of different types of file that
>>Irfanview would open, play & sort out the correct extension. Save me
>>hundreds of clicks & hours of work.
>
> I find Irfanview very useful for all kinds of graphics tasks.

I love that the Irfanview batch command can modify a set of images to
obfuscate fingerprinting (which is important as I upload many images).

This image fingerprinting only gets better by the day where it's already
capable of connecting two disparate images on the net to the exact camera.

Subject: Re: OCR on Windows
From: Mr. Man-wai Chang
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: A noiseless patient Spider
Date: Fri, 19 Jul 2024 16:30 UTC
References: 1
Path: eternal-september.org!news.eternal-september.org!toylet.eternal-september.org!.POSTED!not-for-mail
From: toylet.toylet@gmail.com (Mr. Man-wai Chang)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Sat, 20 Jul 2024 00:30:22 +0800
Organization: A noiseless patient Spider
Lines: 8
Message-ID: <v7e4av$32gjg$1@toylet.eternal-september.org>
References: <v6v74c$80bq$1@matrix.hispagatos.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Fri, 19 Jul 2024 18:30:23 +0200 (CEST)
Injection-Info: toylet.eternal-september.org; posting-host="e5abf789fa33adbd9408b4e928c37594";
logging-data="3228272"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19JJSZDNyeBeE10NBR+IPAd"
User-Agent: Mozilla Thunderbird
Cancel-Lock: sha1:0fwlv3LfS2JEc8fklulJJZkkRKk=
Content-Language: en-US
In-Reply-To: <v6v74c$80bq$1@matrix.hispagatos.org>
View all headers

On 14/7/2024 8:46 am, Bill Powell wrote:
> I have a series of one-page PDFs that are really images and not text even
> though they look like they're just a page of simple text in the same font.
>
> Is there a way to easily OCR a PDF to actual text on Windows for free?

I think the free Micro$oft Onenote app got OCR.... couldn't quite
remmeber! ;)

Subject: Re: OCR on Windows
From: Peter Flynn
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: Usenet Labs Bozon Detector Facility
Date: Wed, 24 Jul 2024 20:18 UTC
References: 1 2
Path: eternal-september.org!news.eternal-september.org!feeder3.eternal-september.org!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: peter@silmaril.ie (Peter Flynn)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Wed, 24 Jul 2024 21:18:33 +0100
Organization: Usenet Labs Bozon Detector Facility
Lines: 16
Message-ID: <lgd5spF2p6iU1@mid.individual.net>
References: <v6v74c$80bq$1@matrix.hispagatos.org>
<k1c69jdmh1hj59pc5nbdaiefs8aak31u84@4ax.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
X-Trace: individual.net UQ4a5yNY5zrKBWypIXO0EAEjbKgVowf0fkEZY5AIYpQZeiDaVC
Cancel-Lock: sha1:yLlq4FcBogIF2yHjiKuLNpmxYGw= sha256:u+zbwPYCPTgdEGhVbLMbCe7nJFUjhDod1IZWk9ZjVsY=
User-Agent: Mozilla Thunderbird
Content-Language: en-GB
In-Reply-To: <k1c69jdmh1hj59pc5nbdaiefs8aak31u84@4ax.com>
View all headers

On 14/07/2024 02:57, micky wrote:
> In alt.comp.os.windows-10, on Sun, 14 Jul 2024 02:46:04 +0200, Bill
> Powell <bill@anarchists.org> wrote:
>
>> I have a series of one-page PDFs that are really images and not text even
>> though they look like they're just a page of simple text in the same font.
>>
>> Is there a way to easily OCR a PDF to actual text on Windows for free?
>
> Aren't there lots of websites that do this, but you have to upload the
> file. I've resisted that but would be really happpy if I could do it
> inside my computer.

Is tesseract not available on Windows?

P

Subject: Re: OCR on Windows
From: Paul
Newsgroups: alt.comp.os.windows-10, alt.comp.os.windows-10, comp.text.pdf
Organization: A noiseless patient Spider
Date: Wed, 24 Jul 2024 23:29 UTC
References: 1 2 3
Path: eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: nospam@needed.invalid (Paul)
Newsgroups: alt.comp.os.windows-10,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: OCR on Windows
Date: Wed, 24 Jul 2024 19:29:31 -0400
Organization: A noiseless patient Spider
Lines: 37
Message-ID: <v7s2os$1uo9a$1@dont-email.me>
References: <v6v74c$80bq$1@matrix.hispagatos.org>
<k1c69jdmh1hj59pc5nbdaiefs8aak31u84@4ax.com>
<lgd5spF2p6iU1@mid.individual.net>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 8bit
Injection-Date: Thu, 25 Jul 2024 01:29:32 +0200 (CEST)
Injection-Info: dont-email.me; posting-host="6421a8cc91a5e7e58ba8bc1a1107119a";
logging-data="2056490"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18zRoYC8ejIYE8sUVaJQWVi8RWKvRbdwA0="
User-Agent: Ratcatcher/2.0.0.25 (Windows/20130802)
Cancel-Lock: sha1:XqMhfdVYZC3QADRPLdzQhXbHhhQ=
Content-Language: en-US
In-Reply-To: <lgd5spF2p6iU1@mid.individual.net>
View all headers

On 7/24/2024 4:18 PM, Peter Flynn wrote:
> On 14/07/2024 02:57, micky wrote:
>> In alt.comp.os.windows-10, on Sun, 14 Jul 2024 02:46:04 +0200, Bill
>> Powell <bill@anarchists.org> wrote:
>>
>>> I have a series of one-page PDFs that are really images and not text even
>>> though they look like they're just a page of simple text in the same font.
>>>
>>> Is there a way to easily OCR a PDF to actual text on Windows for free?
>>
>> Aren't there lots of websites that do this, but you have to upload the
>> file.  I've resisted that but would be really happpy if I could do it
>> inside my computer.
>
> Is tesseract not available on Windows?
>
> P

https://github.com/UB-Mannheim/tesseract/wiki

https://github.com/UB-Mannheim/tesseract/releases/download/v5.4.0.20240606/tesseract-ocr-w64-setup-5.4.0.20240606.exe

https://github.com/UB-Mannheim/tesseract/wiki/Install-additional-language-and-script-models

https://tesseract-ocr.github.io/tessdoc/Data-Files

The english file (training data), as an example, is 14.7MB.

*******
tesseract-ocr-w64-setup-5.4.0.20240606.exe 50,175,248 bytes

https://www.virustotal.com/gui/file/c885fff6998e0608ba4bb8ab51436e1c6775c2bafc2559a19b423e18678b60c9

Haven't tested that.

Paul

Pages:12

rocksolid light 0.9.8
clearnet tor