An pdf can contain selectable text and/or images. If you use the pdf function in OSX you get a pdf with selectable text, (unless you are making a pdf
of an image). Try opening the pdf in Preview and use the text tool to select the text.
If the pdf already contains text, you can grap the text portion of the pdf using the 360Works Scribe plugin.
If the text grabbed from the pdf has a logical order, you could use a looping script to split the text into records.
If the pdf is an image (because you ran a document through a scanner or something) you can use the ABBYY FineReader to read the text and use the above steps to split it into separate records. OCR text is less regular (and error free) then text incorparated in pdf.
There are several ways to grab text from the web. You can grab the source of a webviewer or use a plugin to get the source of a web url
Hope that helps,
Ruben van den Boogaard
Eric Cunningham wrote, in part:
The second task we have is similar, but I don’t think OCR software will work. We have access to web-based membership lists we also want to match import to our database. They can’t be downloaded and printing as PDF leads to the same problem as above.
I have worked with some web-based systems where the reports are also available as comma-separated value files (.CSV). These can be imported into FileMaker without a hitch. Try contacting your web source support to see if CSVs can be made available to you.