Use Automator and AppleScript to get the New York Times for free

Discussion in 'Mac Programming' started by austinbond, Sep 5, 2007.

  1. austinbond macrumors member

    Joined:
    Feb 14, 2004
    #1
    Hey guys,
    I found this site: http://www.nytimes.com/pages/todayspaper/index.html
    where the NYT publishes ever article from that day's paper on the same page. If one could use some Automator and possible AppleScript, on could legally download all of the articles from the site, and skip out of paying the $600 a year. This is again legal because the NYT has "Print This" links on all of its articles.

    I'll explain what I've done so far in hopes that someone can use it to finish this project. First I opened Automator and I used these actions:

    Get Specified URLS...with the url supplies above >>> Get link URLs from Webpages >>> Get text from Webpages

    I got an error that looked like this:
    sh: -c: line 1: unexpected EOF while looking for matching `''
    sh: -c: line 2: syntax error: unexpected end of file (2)

    I asked a friend and he mentioned I would need AppleScript which I know nothing about. Please if anyone could help me build this workflow I would appreciate it very much.

    My hope is to have the final product universal where I won't need to tweak at all everyday when I run it, and that all the pages are just the text of the articles to save on space and distractions. Then I hope to print, or read it directly off my new MacBook.
     
  2. austinbond thread starter macrumors member

    Joined:
    Feb 14, 2004
  3. MongoTheGeek macrumors 68040

    MongoTheGeek

    Joined:
    Sep 13, 2003
    Location:
    Its not so much where you are as when you are.
    #3
    Possible and easy.

    I've done essentially that in Obj-C before. I've done similar in AppleScript.

    There are commercial products out there that scrape websites as well.
     
  4. austinbond thread starter macrumors member

    Joined:
    Feb 14, 2004
    #4
    Thanks for replying! I'm glad to hear my theatrical ideas aren't just that :)
    If it is easy, could someone take it up a project...I don't have any money as I spent it all on this new toy, but it would be very helpful to me and I'm sure other geeks who can't spend $600 on newspapers. I love blogs, but I need some unbiased opinions every once in a while (as unbiased as the NYT gets) and I want to know what I am reading is free of libel. Also, the NYT has a history of revealing those things that some men want covered up and I think it is important for everyone to know when they are getting swindled. I understand that I could just read it everyday directly on the site, but I would like something that could take all the messiness and put it into a PDF, for me to print out or read it directly from the display.

    If anyone is just looking to show off their abilities, please consider helping me, and possibly many others.
     
  5. bluebomberman macrumors 6502a

    bluebomberman

    Joined:
    Jan 9, 2005
    Location:
    Queens, NYC
    #5
    It would just be easier to register with an .edu email address.
     
  6. mkrishnan Moderator emeritus

    mkrishnan

    Joined:
    Jan 9, 2004
    Location:
    Grand Rapids, MI, USA
    #6
    Do you get free access to premium content if you do that?

    I just read the Times online daily. I save PDFs when I want them. But I don't subscribe to the select / premium content. Does that content run in physical print?
     
  7. mkrishnan Moderator emeritus

    mkrishnan

    Joined:
    Jan 9, 2004
    Location:
    Grand Rapids, MI, USA
  8. austinbond thread starter macrumors member

    Joined:
    Feb 14, 2004
    #9
    Yup it's a great service they offer students; just wish I had in a portable offline version that didn't cost so much.
     
  9. bluebomberman macrumors 6502a

    bluebomberman

    Joined:
    Jan 9, 2005
    Location:
    Queens, NYC
    #10
    Hmm...I suppose an iPhone/iPod Touch is not what you're looking for?
     
  10. SC68Cal macrumors 68000

    Joined:
    Feb 23, 2006

Share This Page