A Python-based CLI for working with Hacker News.

nicksergeant · on Nov 21, 2011

I started this to specifically dump my saved stories to an XML feed, which I hook up to http://ifttt.com and auto-import into Pinboard. Working great, so far.

I know there are a bunch of HN CLIs out there, but none handle saved stories (the only thing this supports, right now).

pyre · on Nov 21, 2011

Mine only does saved stories. Though it's in Perl, not Python. It was whipped up in response to a complaint several months ago. It just never had a Show HN.

Http://github.com/bsandrow/hn-profile/

draegtun · on Nov 21, 2011

Shame you didn't do a Show HN because it looks interesting/promising so thanks for pointing me to it.

In return I can give you a clickable link :) http://github.com/bsandrow/hn-profile/

ez77 · on Nov 21, 2011

Item #N I'm learning on HN today: .netrc! I read that I need a "machine remote_hostname token pair". What should I use for remote_hostname in this case? Thanks!

ez77 · on Nov 21, 2011

Self-answer, for anyone interested. It's in the source code [1]:

  die "Error: No matches for news.ycombinator.com in ~/.netrc."

[1] https://github.com/bsandrow/hn-profile/blob/master/hn-profil...

dmn001 · on Nov 21, 2011

Last week I had also written the same thing in Perl to login and handle saved stories, and then add them to a folder in Instapaper.

ez77 · on Nov 21, 2011

  in response to a complaint

What kind of complaint?

pyre · on Nov 21, 2011

Someone was complaining in the comments that there was no easy way to get the full list of saved stories back out of HN. I whipped up this script to do that in response. It had been an idea I was batting around in the past, but the comment provided the motivation to create it.

edit: Looked up the thread, and turns out it was an inquiry, and not a complaint.

http://news.ycombinator.com/item?id=2479416

narag · on Nov 21, 2011

Thank you very much, I've been wanting to do that for a long time.

nicksergeant · on Nov 21, 2011

The next piece of this I'll be building out is a method for retrieving replies to your comments (I want an email when that happens).

pault · on Nov 21, 2011

You can also use http://hnnotify.com for this. Works great.

nicksergeant · on Nov 21, 2011

Ah, very cool - thanks!

ez77 · on Nov 21, 2011

  Retrieve all saved items [...]

This, by itself, is a wish come true for this programmer wannabe. Thank you wholeheartedly.

RyanMcGreal · on Nov 21, 2011

See also: https://github.com/scottjacksonx/hnsh

nicksergeant · on Nov 21, 2011

Saw that but it hasn't been updated in over a year. Also didn't look like there was a quick way to export saved stories as XML.

nicksergeant · on Nov 21, 2011

If anyone has any ideas as to how best to handle cookie caching, I'm all ears.

mp3jeep01 · on Nov 21, 2011

You might want to take a look into using something like twill (instead of requests) and BeautifulSoup instead of pyquery -- twill in particular will allow more control over cookies, etc.,

nicksergeant · on Nov 21, 2011

Thought about that, but using Requests + PyQuery was what made this project a joy to work with in the first place :)

I'm planning on just writing cookie data to a file and using that if available.

pyre · on Nov 21, 2011

I prefer lxml to BeautifulSoup.

schrijver · on Nov 21, 2011

I can’t find the link now, but I remember something about BeautifulSoup being deprecated—it didn’t support HTML5 last time I checked. LXML is great. http://code.google.com/p/html5lib/ is also a nice parser for HTML5 documents.

Edit: pyquery wraps the aforementioned LXML. Seems like a good fit for jquery style selection…

pyre · on Nov 21, 2011

I tried PyQuery ~1 year ago, and immediately found issues with it (IIRC it was having trouble selecting an element that had two classes, when the selector was only specifying one of those classes). I may have to revisit that if people are recommending it with positive reviews.

nicksergeant · on Nov 22, 2011

Give it a shot again. I've used it with several projects over the last 6 months and have had no issues. One of the projects also involved some fairly heinous malformed HTML, and PyQuery performed well.

vrode · on Nov 21, 2011

This is an enormous undertaking.