URL Extractor
Extract & Filter URLs From Any Text Instantly
Paste any text, HTML, email, log file, or document and get a clean, filtered list of every URL inside it. This URL extractor finds all the links automatically, removes duplicates, lets you filter by domain, file type, or keyword, and exports the results as a copy-ready list, a .txt file, or a .csv spreadsheet — all in your browser, with nothing uploaded to a server.
How to Extract URLs From Text
- Paste your text into the input box on the left — articles, HTML source, email exports, server logs, or JSON all work.
- The tool scans the text instantly and lists every URL it finds in the output panel on the right, with live counts for total, unique, and filtered results.
- Narrow the list using the filter pills (HTTPS only, images, documents, social links, and more) or the advanced domain/keyword/exclude fields.
- Turn on “No duplicates,” “Sort A–Z,” or “Strip params” to clean up the list further.
- Copy the results, copy just the unique domains, or download the list as a
.txtor.csvfile.
Filter URLs by Type, Not Just by Eye
Most online URL extractors hand you one long, unsorted list and leave the sorting to you. This one filters as you go:
- HTTPS / HTTP only — isolate secure links or flag insecure ones during a security or migration audit.
- Images — pull every
.jpg,.png,.svg,.webp, or.giflink out of a page’s source. - Documents — grab PDF, Word, Excel, CSV, and text file links in one pass.
- Media — find video and audio file links (
.mp4,.mp3,.webm, and more). - Social — collect every link pointing to Twitter/X, Facebook, Instagram, LinkedIn, TikTok, YouTube, and other social platforms.
- API — surface endpoint URLs containing
/api/,/v1/,graphql, or similar patterns. - Has params / No params — split URLs by whether they carry a query string.
- Short URLs — flag links shortened through Bitly, TinyURL, t.co, and similar services.
- CDN — isolate asset links served from Cloudflare, Cloudfront, jsDelivr, unpkg, and other CDNs.
- Mail — pull out webmail and
/mail/links specifically.
Search and Exclude by Domain or Keyword
Beyond the category filters, three advanced fields let you narrow results further: show only URLs from a specific domain, show only URLs containing a keyword, or exclude a domain entirely. These three filters combine with the category pills, so you can, for example, pull every PDF link from a single domain in one step.
Clean Up the List Automatically
- No duplicates removes repeated URLs (case-insensitive) while keeping the first occurrence, so the order of the original text is preserved.
- Sort A–Z alphabetizes the final list, useful when comparing two extracts or scanning for a specific link.
- Strip params trims everything after the
?or#, collapsing tracked links likepage.html?utm_source=xdown to the bare URL — handy when you only care about the destination, not the tracking tags.
Export Your Results
Once the list is filtered the way you want it, copy the whole list with one click, copy just the unique domains (useful for a quick competitor or backlink scan), or download the results as a .txt file for plain text use or a .csv file with URL and domain columns for spreadsheets and reporting tools.
Who Uses a URL Extractor?
- SEO professionals pulling outbound links from a page’s HTML to audit link structure or check for broken/insecure links.
- Marketers compiling every link from a newsletter or campaign HTML to verify destinations before sending.
- Researchers and students gathering source links scattered across articles, PDFs, or reference lists for a bibliography.
- Developers and QA testers extracting links from logs or test fixtures to check formatting or flag unexpected domains.
- Support and ops teams scanning server or chat logs for the URLs users actually clicked or shared.
Why Use This URL Extractor
Everything runs locally in your browser — your text is never uploaded or stored, so it’s safe to paste sensitive logs, internal documents, or unpublished content. There’s no signup, no rate limit, and no ads interrupting the workflow. And unlike most free extractors that return one undifferentiated list, the category filters mean you usually don’t need a second tool to sort the results afterward.
FAQ
What is a URL extractor?
A URL extractor is a tool that automatically scans a block of text and pulls out every web address it contains, saving you from manually searching through the content and copying links one by one.
How do I extract URLs from a block of text?
Paste the text into the input box and the tool extracts every URL automatically — there’s no button to click or settings required to get a basic list, though filters and toggles are available if you want to narrow the results.
Can it extract URLs from HTML source code?
Yes. Paste raw HTML — including href attributes, src attributes, or inline script content — and the tool finds every valid URL inside it, the same way it would in plain text.
Does it remove duplicate URLs automatically?
Yes, the “No duplicates” toggle is on by default and removes repeated URLs while keeping the first occurrence, though you can turn it off if you want to see every instance, including repeats.
Can I extract only image links, PDF links, or social media links?
Yes. Use the category filter pills to show only images, documents, media files, social links, API endpoints, short URLs, CDN assets, or mail links — or combine a category filter with the domain and keyword fields for a more specific result.
Is my text uploaded or stored anywhere?
No. The extraction happens directly in your browser; nothing you paste is sent to a server or saved, which makes it safe to use with internal documents, logs, or unpublished content.
Can I export the extracted URLs?
Yes. You can copy the full list, copy just the unique domains, or download the results as a .txt or .csv file.
Is this URL extractor free to use?
Yes, it’s completely free with no signup, login, or usage limit.
What’s the difference between this and a website link crawler?
This tool extracts URLs from text you paste in — articles, HTML, emails, logs, and similar content. It does not crawl a live website or follow links across pages; for that, you’d need a separate site-crawling tool. If your text already contains a page’s full HTML source, though, this tool will still pull every link out of it.