
httrack
HTTrack Website Copier, copy websites to your computer (Official repository)
The Lens
HTTrack copies an entire website to your hard drive so you can browse it offline. Point it at a URL and it recursively pulls down the HTML, images, and assets, rebuilding the link structure so the local mirror works exactly like the live site. It's been doing this one job well since the early 2000s, on Windows as WinHTTrack and on Linux as WebHTTrack.
There's nothing to set up beyond installing it. It resumes interrupted downloads, updates existing mirrors, and gives you filters to control how deep it crawls and what it grabs. For a tool this old, it's remarkably dependable, which is why it's still the default answer for site archiving.
Anyone who needs a local copy of a site, for offline reading, archiving before a shutdown, or grabbing documentation you can't count on staying up, gets exactly that. It's not a modern scraper for structured data extraction, reach for Scrapy or Playwright if you need to parse and transform what you pull. HTTrack mirrors; it doesn't analyze.
The catch: the web moved on. Heavy JavaScript sites that render content client-side don't mirror cleanly, because HTTrack grabs what the server sends, not what a browser builds. It shines on traditional, server-rendered sites and struggles with modern single-page apps. Know which kind you're pointing it at.
Free vs Self-Hosted vs Paid
fully freeFree tier: Free and open source under GPL-3.0. No paid version, no limits.
Self-hosted: It's a desktop app you install, not a server. Free.
Paid: None. There is no commercial tier.
Free and open source (GPL-3.0). No paid tier, no catch on cost.
Get tools like this every Wednesday
One featured tool, three on the radar. No fluff.
License: GNU General Public License v3.0
Commercial OK but must share source of modifications.
Commercial use: ✓ Yes
About
- Owner
- Xavier Roche (User)
- Stars
- 4,521
- Forks
- 757
Explore Further
More tools in the directory
openclaw
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
378.6k ★everything-claude-code
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
215.0k ★claw-code
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
193.8k ★