Comment by rovr138
10 hours ago
Here's an example, https://shottr.cc/s/165K/SCR-20260131-oys.png
Claude Code v2.1.29
Opus 4.5 · Claude Team
~/stow/tmp
I have a new project
In it, I want to scrape hacker news data.
The project will be built in python.
I want to search first what are the latest projects in python that could help me extract the data. This could be via api or scraping it.
Once you have a plan, ask me 10 questions to help navigate the structure.
I'll help you research the latest Python options for extracting Hacker News data. Let me search for current tools and approaches.
Web Search("Python Hacker News API scraping library 2026")
⎿ Did 1 search in 20s
Web Search("Hacker News official API Python wrapper 2025 2026")
⎿ Did 1 search in 17s
Web Search("best Python library scrape Hacker News data extraction")
⎿ Did 1 search in 20s
Let me get more details on the official Hacker News API and some specific library documentation.
Fetch(https://github.com/HackerNews/API)
⎿ Received 373.8KB (200 OK)
Fetch(https://pypi.org/project/hackernews-python/)
⎿ Received 3KB (200 OK)
Fetch(https://github.com/avinassh/haxor)
⎿ Received 378.1KB (200 OK)
I've gathered comprehensive research on the available options. Let me write up my findings and then ask you the 10 questions.
Updated plan
⎿ /plan to preview
Research Summary
I found several solid options for extracting Hacker News data in Python:
Best Options Found:
1. Official HN API (Firebase-based) - Free, no rate limits, real-time data
- Source: https://github.com/HackerNews/API
2. haxor - Most popular Python wrapper for the official API
- Source: https://github.com/avinassh/haxor
- Install: pip install haxor
3. DIY Scraping - BeautifulSoup + Requests/httpx for data not in the API
- Source: https://www.proxiesapi.com/blog/scraping-the-hacker-news-with-python-and-beautiful.html.php
4. Commercial - https://apify.com/epctex/hackernews-scraper for hosted solutions
---
Now, here are my 10 questions to help define your project structure:
───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
← Data Source Data Types Storage Use Case Submit →
Do you want to use the official HN API (recommended, no rate limits) or scrape the website directly?
1. Official API (Recommended)
Use Firebase-based API - reliable, real-time, no rate limits
2. Web Scraping
Scrape HTML directly - more flexible but requires maintenance
3. Both
API as primary, scraping for data not available via API
4. Type something.
───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
5. Chat about this
6. Skip interview and plan immediately
Enter to select · Tab/Arrow keys to navigate · Esc to cancel
Wow, I had no idea. Thanks.
Yeah, it's pretty nifty.
It's got a couple tools built into it that it can use.