Turn websites into
LLM-ready data
Power your AI apps with clean data crawled from any website. It's also open-source.
Crawl, Scrape, Clean
We crawl all accessible subpages and give you clean markdown for each. No sitemap required.
[ { "url": "https://www.firecrawl.dev/", "markdown": "## Welcome to Firecrawl Firecrawl is a web scraper that allows you to extract the content of a webpage." }, { "url": "https://www.firecrawl.dev/features", "markdown": "## Features Discover how Firecrawl's cutting-edge features can transform your data operations." }, { "url": "https://www.firecrawl.dev/pricing", "markdown": "## Pricing Plans Choose the perfect plan that fits your needs." }, { "url": "https://www.firecrawl.dev/about", "markdown": "## About Us Learn more about Firecrawl's mission and the team behind our innovative platform." } ]
Note: The markdown has been edited for display purposes.
Integrate today
Enhance your applications with top-tier web scraping and crawling capabilities.
1
2
3
4
5
6
7
8
9
10
11
12
// npm install @mendable/firecrawl-js
import FirecrawlApp from '@mendable/firecrawl-js';
const app = new FirecrawlApp({ apiKey: "fc-YOUR_API_KEY" });
// Scrape a website:
const scrapeResult = await app.scrapeUrl('firecrawl.dev');
if (scrapeResult.success) {
console.log(scrapeResult.markdown)
}
Start for free, scale easily
Kick off your journey for free and scale seamlessly as your project expands.
Open-source
We handle the hard stuff
Rotating proxies, orchestration, rate limits, js-blocked content and more
Crawling
Firecrawl crawls all accessible subpages, even without a sitemap.
Dynamic content
Firecrawl gathers data even if a website uses javascript to render content.
To Markdown
Firecrawl returns clean, well formatted markdown - ready for use in LLM applications
Reliability first
Reliability is our core focus. Firecrawl is designed to ensure you get all the data you need.
No Caching
Firecrawl doesn't cache content by default. You always get the latest data.
Built for AI
Built by LLM engineers, for LLM engineers. Giving you clean data the way you want it.
Smart Wait
Firecrawl can intelligently wait for content to load, making scraping faster and more reliable.
Actions
Click, scroll, write, wait, press and more before extracting content.
Media Parsing
Firecrawl can parse and output clean content from web hosted pdfs, docx, images and more.
Our wall of love
Don't take our word for it
Flexible Pricing
Start for free, then scale as you grow
Free Plan
500 credits
$0 one-time
No credit card required- Scrape 500 pages
- 10 /scrape per min
- 1 /crawl per min
Hobby
3,000 credits per month
$16/month
$228/yr$190/yr (Billed annually)- Scrape 3,000 pages*
- 20 /scrape per min
- 3 /crawl per min
StandardMost Popular
100,000 credits per month
$83/month
$1188/yr$990/yr (Billed annually)- Scrape 100,000 pages*
- 100 /scrape per min
- 10 /crawl per min
- 3 seats
Growth
500,000 credits per month
$333/month
$4788/yr$3990/yr (Billed annually)- Scrape 500,000 pages*
- 1000 /scrape per min
- 50 /crawl per min
- 5 seats
- Priority Support
Add-ons
Auto Recharge Credits
Automatically recharge your credits when you run low.
Subscribe to a plan to enable auto recharge
Credit Pack
Purchase a pack of additional monthly credits.
Subscribe to a plan to purchase credit packs
Enterprise Plan
Unlimited credits. Custom RPMs.
- Top priority support
- Feature Acceleration
- SLAs
- Account Manager
- Custom rate limits volume
- Custom concurrency limits
- Custom seats
- CEO's number
* a /scrape refers to the scrape API endpoint. Structured extraction costs vary. See credits table.
* a /crawl refers to the crawl API endpoint.
FAQ
Frequently asked questions about Firecrawl
General
What is Firecrawl?
Firecrawl turns entire websites into clean, LLM-ready markdown or structured data. Scrape, crawl and extract the web with a single API. Ideal for AI companies looking to empower their LLM applications with web data.
What sites work?
Firecrawl is best suited for business websites, docs and help centers. We currently don't support social media platforms.
Who can benefit from using Firecrawl?
Firecrawl is tailored for LLM engineers, data scientists, AI researchers, and developers looking to harness web data for training machine learning models, market research, content aggregation, and more. It simplifies the data preparation process, allowing professionals to focus on insights and model development.
Is Firecrawl open-source?
Yes, it is. You can check out the repository on GitHub. Keep in mind that this repository is currently in its early stages of development. We are in the process of merging custom modules into this mono repository.
What is the difference between Firecrawl and other web scrapers?
Firecrawl is designed with reliability and AI-ready data in mind. We focus on delivering data reliably and in a LLM-ready format - so you can spend less tokens and build better AI applications.
What is the difference between the open-source version and the hosted version?
Firecrawl's hosted version features Fire-engine which is our proprietary scraper that takes care of proxies, anti-bot mechanisms and more. It is an intelligent scraper designed to get the data you need - reliably. The hosted version also allows for actions (interacting with the page before scraping), a dashboard for analytics, and it is 1 API call away.
Scraping & Crawling
How does Firecrawl handle dynamic content on websites?
Unlike traditional web scrapers, Firecrawl is equipped to handle dynamic content rendered with JavaScript. It ensures comprehensive data collection from all accessible subpages, making it a reliable tool for scraping websites that rely heavily on JS for content delivery.
Why is it not crawling all the pages?
There are a few reasons why Firecrawl may not be able to crawl all the pages of a website. Some common reasons include rate limiting, and anti-scraping mechanisms, disallowing the crawler from accessing certain pages. If you're experiencing issues with the crawler, please reach out to our support team at help@firecrawl.com.
Can Firecrawl crawl websites without a sitemap?
Yes, Firecrawl can access and crawl all accessible subpages of a website, even in the absence of a sitemap. This feature enables users to gather data from a wide array of web sources with minimal setup.
What formats can Firecrawl convert web data into?
Firecrawl specializes in converting web data into clean, well-formatted markdown. This format is particularly suited for LLM applications, offering a structured yet flexible way to represent web content.
How does Firecrawl ensure the cleanliness of the data?
Firecrawl employs advanced algorithms to clean and structure the scraped data, removing unnecessary elements and formatting the content into readable markdown. This process ensures that the data is ready for use in LLM applications without further preprocessing.
Is Firecrawl suitable for large-scale data scraping projects?
Absolutely. Firecrawl offers various pricing plans, including a Scale plan that supports scraping of millions of pages. With features like caching and scheduled syncs, it's designed to efficiently handle large-scale data scraping and continuous updates, making it ideal for enterprises and large projects.
Does it respect robots.txt?
Yes, Firecrawl crawler respects the rules set in a website's robots.txt file. If you notice any issues with the way Firecrawl interacts with your website, you can adjust the robots.txt file to control the crawler's behavior. Firecrawl user agent name is 'FirecrawlAgent'. If you notice any behavior that is not expected, please let us know at help@firecrawl.com.
What measures does Firecrawl take to handle web scraping challenges like rate limits and caching?
Firecrawl is built to navigate common web scraping challenges, including stealth proxies, rate limits, and smart wait. It smartly manages requests and employs techniques to minimize bandwidth usage and avoid triggering anti-scraping mechanisms, ensuring reliable data collection.
Does Firecrawl handle captcha or authentication?
Firecrawl avoids captcha by using stealth proxies. When it encounters captcha, it attempts to solve it automatically, but this is not always possible. We are working to add support for more captcha solving methods. Firecrawl can handle authentication by providing auth headers to the API.
API Related
Where can I find my API key?
Click on the dashboard button on the top navigation menu when logged in and you will find your API key in the main screen and under API Keys.
Billing
Is Firecrawl free?
Firecrawl is free for the first 500 scraped pages (500 free credits). After that, you can upgrade to our Standard or Growth plans for more credits and higher rate limits.
Is there a pay per use plan instead of monthly?
We currently do notoffer a pay per use plan, instead you can upgrade to our Standard or Growth plans for more credits and higher rate limits.
How many credit does scraping, crawling, and extraction cost?
Scraping costs 1 credit per page. Crawling costs 1 credit per page. Check out the credits table in the pricing page for more details.
Do you charge for failed requests (scrape, crawl, extract)?
We do not charge for any failed requests (scrape, crawl, extract). Please contact support at help@firecrawl.com if you have notice something wrong.
What payment methods do you accept?
We accept payments through Stripe which accepts most major credit cards, debit cards, and PayPal.