Extracting text from PDF

Discussion in 'Mac Apps and Mac App Store' started by Dale Cooper, Sep 22, 2005.

  1. Dale Cooper macrumors regular

    Dale Cooper

    Joined:
    Sep 20, 2005
    #1
    Is there a simple way to copy text from a .pdf file (in preview or Acrobat Reader), so that i can paste it into a .pages document and edit it? I'm able to copy text from one page at the time, but with longer text it will take forever. Help me, please!
     
  2. Blue Velvet Moderator emeritus

    Joined:
    Jul 4, 2004
    #2
    It is indeed an honor and a privilege to reply to another Lynch fan's query although my 2 cents worth may disappoint. :)

    To extract the text as a whole, you will need Acrobat Pro or possibly Standard, I'm afraid. Which costs money...

    Acrobat Pro gives you the option of saving the document as a Word file or a number of different universal formats e.g. HTML etc.

    BTW: Twin Peaks Season 2 DVD should be coming soon...

    Edit: Saying that, if you have access to a PC you may be able to download a fully functional demo (tryout) of Acrobat Pro for Windows and try that. For some wonderful reason, Adobe have resisted making the demo of this software available for Macs. :(

    116mb download
    http://www.adobe.com/products/tryadobe/main.jsp#product=1
     
  3. Dale Cooper thread starter macrumors regular

    Dale Cooper

    Joined:
    Sep 20, 2005
    #3
    It is indeed an honor and a privilege to get a reply from another Lynch fan. I'm really looking forward to the DVD release. I need something to look forward to, cause it's seems I have a lot of copying and pasting in front of me...
     
  4. Dale Cooper thread starter macrumors regular

    Dale Cooper

    Joined:
    Sep 20, 2005
  5. Blue Velvet Moderator emeritus

    Joined:
    Jul 4, 2004
    #7
    No worries... if any of these work, you may have some time spare to grab a damned fine cup of coffee and slice of cherry pie. ;)
     
  6. wordmunger macrumors 603

    wordmunger

    Joined:
    Sep 3, 2003
    Location:
    North Carolina
    #8
    Personally I think I'd go for a Pabst Blue Ribbon.

    Oh, was this thread about PDFs? Sorry, I have nothing useful to say.
     
  7. iSaint macrumors 603

    iSaint

    Joined:
    May 26, 2004
    Location:
    South Mississippi y'all, near the water!
    #9
    If you're reading the document via Adobe Acrobat Reader 7.x, you can choose a select text cursor option. You can highlight>copy>paste into your new document. Or you can select all: Apple + Q and copy>paste. I do this often.

    EDIT: it must be karma, I'm a Lynch fan, too!
     
  8. Mitthrawnuruodo Moderator emeritus

    Mitthrawnuruodo

    Joined:
    Mar 10, 2004
    Location:
    Bergen, Norway
    #10
    You can extract text using Preview, too. Just use the text tool (the 'A' tool mode)... unless the document is password protected, of course...
     

    Attached Files:

  9. iSaint macrumors 603

    iSaint

    Joined:
    May 26, 2004
    Location:
    South Mississippi y'all, near the water!
    #11
    yep, yep...thanks M!
     
  10. Applespider macrumors G4

    Applespider

    Joined:
    Jan 20, 2004
    Location:
    looking through rose-tinted spectacles...
    #12
    Minor OT here - I've been to the cafe in Snoqualmie where Agent Cooper ate his cherry pie and I can confirm it is just as damned fine as the coffee... :)

    Back on topic... but I'm at my PC. Isn't there something in Automator that can strip text or images out of PDFs?
     
  11. Blue Velvet Moderator emeritus

    Joined:
    Jul 4, 2004
    #13
    Yes, but only one page at a time. What if you have a 400 page document that you need correctly-formatted text from, which is the kind of process the original OP was referring to?
     
  12. Mitthrawnuruodo Moderator emeritus

    Mitthrawnuruodo

    Joined:
    Mar 10, 2004
    Location:
    Bergen, Norway
    #14
    Hmmm... cmd-A to mark all text in a multi page pdf file works for me... :confused:
     
  13. wordmunger macrumors 603

    wordmunger

    Joined:
    Sep 3, 2003
    Location:
    North Carolina
    #15
    That works in Acrobat reader, but not in Preview (for me, in Jaguar).
     
  14. erickkoch macrumors 6502a

    Joined:
    Jan 13, 2003
    Location:
    Kalifornia
    #16
    If you have larger documents and with mostly text and pictures, you could try ReadIris. It's an OCR that will electronically read a document and retype it into another application like Word. It even keeps the formatting intact. It's kind of buggy, but it works quite well for basic documents and will convert text that is from pdf, jpg, tiff, etc. Anything it can read. You can also use a scanner to copy a page and it can convert it to a doc you can edit.

    Download the free trial and try it. One thing that bugs me about the program is that it's limited to 50 pages, so if you have a larger document you'll have to break it down to 50 pages or less with PDFLab or similar utility, then combine them again in Word. The cost is $129 but after the trial they sent me an e-mail to "upgrade" to the full version for about half price for a limited time.
     
  15. Mitthrawnuruodo Moderator emeritus

    Mitthrawnuruodo

    Joined:
    Mar 10, 2004
    Location:
    Bergen, Norway
    #17
    I see. It works in Preview 3 (which came with Tiger) but apparently not in Preview 2 (the older Panther one, which also works in Tiger ;)).
     
  16. Blue Velvet Moderator emeritus

    Joined:
    Jul 4, 2004
    #18
    And is there a paragraph return at the end of each line, perhaps?
     
  17. Mitthrawnuruodo Moderator emeritus

    Mitthrawnuruodo

    Joined:
    Mar 10, 2004
    Location:
    Bergen, Norway
    #19
    Probably... but that's not the worst, if there's a header and/or footer that too will be copied... ;)

    But, still nothing one or two "find and replace" actions in a good text editor will not fix. :)
     
  18. Julsy macrumors newbie

    Joined:
    Oct 31, 2013
    #20
    hey im also looking for a way to do this same exact thing
    im pretty sure there is a decent way, but so far ive only found http://pdftoword.pro/ which kind of jumbles things up, its still ok i guess if you have the time to sort everything out :/

    does anyone know any flawless converters?
    regards
     

Share This Page