[PD] OCR with Puredata?
Jack
jack at rybn.org
Thu Jun 2 15:05:06 CEST 2011
Le mercredi 01 juin 2011 à 12:51 -0400, Tedb0t a écrit :
> > Is it just me, or it sounds like it's going to take a lot of
> > preprocessing before you can even think of feeding it to a neural
> > network ?
>
>
> Black/white thresholding and resolution reduction, that's it.
I also use grey images and it was working fine.
>
>
> > Human vision is made of a lot more layers of neurons than we can
> > hope to deal with in artificial networks.
>
>
> It sure is. Luckily you only need a few to do some basic OCR. You
> could try a grid of something like 10x10 pixels just to start, which
> would require 100 input neurons. The lower the resolution, the higher
> your error rate will be, but you can find good compromises.
I was working with a grid of 16x16 pixels and it was OK for me.
>
>
> > At what angles should characters be recognised ?
>
>
> Well, that's the beautiful thing about neural nets—it just depends on
> how you train the net. If you want the net to be able to recognize
> tilted letters, you can add tilted letters to the training sets. It
> can affect
Yep, if you train your NN with angles, deformations, etc. You should get
good results.
++
Jack
>
>
> > But making an OCR using ANN is a lot lot more work than using an OCR
> > library. Making a Pd-to-OCR-library interface is less work than
> > making an OCR abstraction library
>
>
> Agreed, strongly. OCR is a really excellent exercise in neural nets
> (look up Self-Organizing Maps or Kohonen networks), but it's a lot of
> work. It would be faster by far to set up an interface as Mathieu
> suggests. However, if you want to go through with it anyway, I'd love
> to help!
>
>
> It looks like the defacto open ocr lib is
> Tesseract: http://code.google.com/p/tesseract-ocr/ This would be
> great to have in Pd.
>
>
> Incidentally, I can't see any reason why Pd would be "bad at it,"
> since the ANN external uses the C FANN library, which is likely what
> any other library would use in the first place.
>
>
> ±tedb0t
>
>
>
> On Jun 1, 2011, at 11:49 AM, Mathieu Bouchard wrote:
>
> > On Wed, 1 Jun 2011, Jack wrote:
> >
> > > You can do this with the use of artificial neural network (for
> > > character recognition). There are externals for Pd :
> > > http://pure-data.svn.sourceforge.net/viewvc/pure-data/trunk/externals/ann/
> >
> > Is it just me, or it sounds like it's going to take a lot of
> > preprocessing before you can even think of feeding it to a neural
> > network ?
> >
> > Human vision is made of a lot more layers of neurons than we can
> > hope to deal with in artificial networks.
> >
> > At what angles should characters be recognised ?
> >
> > Which colour on which colour ?
> >
> > You better settle those things first, so that you can figure out how
> > you can reduce your data beforehand.
> >
> > But making an OCR using ANN is a lot lot more work than using an OCR
> > library. Making a Pd-to-OCR-library interface is less work than
> > making an OCR abstraction library... and it isn't necessarily
> > because Pd would be bad at it (I don't know about that). It's more
> > because it takes a lot of knowledge to make an OCR library from
> > nearly scratch.
> >
> > _______________________________________________________________________
> > | Mathieu Bouchard ---- tél: +1.514.383.3801 ---- Villeray,
> > Montréal, QC
> > _______________________________________________
> > Pd-list at iem.at mailing list
> > UNSUBSCRIBE and account-management ->
> > http://lists.puredata.info/listinfo/pd-list
> >
>
> _______________________________________________
> Pd-list at iem.at mailing list
> UNSUBSCRIBE and account-management -> http://lists.puredata.info/listinfo/pd-list
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 490 bytes
Desc: This is a digitally signed message part
URL: <http://lists.puredata.info/pipermail/pd-list/attachments/20110602/9b163abe/attachment.pgp>
More information about the Pd-list
mailing list