Summary
Website Quality Score
Performance 10.0
SEO 4.4
Security 6.5
Accessibility 5.0
Best Practices 9.2
- ⛔ Skipped URLs - 29 skipped URLs found.
- ⛔ 404 CRITICAL - 6 non-existent pages found.
- ⛔ 5 page(s) with multiple <h1> headings.
- ⛔ 10 page(s) without <h1> heading.
- ⛔ Security - 207 pages(s) with critical finding(s).
- ⚠️ The description '🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper' exceeds the allowed 10% duplicity. 92% of pages have this same description.
- ⚠️ 63 page(s) do not support Brotli compression.
- ⚠️ No WebP image found on the website.
- ⚠️ No AVIF image found on the website.
- ⚠️ 66 page(s) with skipped heading levels.
- ⚠️ 61 page(s) without form labels.
- ⚠️ 63 page(s) without aria labels.
- ⚠️ 62 page(s) without role attributes.
- ⏩ Loaded robots.txt for domain 'docs.crawl4ai.com': status code 404, size 29 kB and took 578 ms.
- ⏩ External URLs - 29 external URL(s) found.
- ⏩ Redirects - 2 redirect(s) found.
- ⏩ DNS IPv6: domain docs.crawl4ai.com does not support IPv6 (DNS server: 127.0.0.53).
- ✅ SSL/TLS certificate is valid until Jun 8 12:36:45 2026 GMT. Issued by C = US, O = Let's Encrypt, CN = E8. Subject is CN = crawl4ai.com.
- ✅ SSL/TLS certificate issued by 'C = US, O = Let's Encrypt, CN = E8'.
- ✅ Performance OK - all non-media URLs are faster than 3 seconds.
- ✅ HTTP headers - found 7 unique headers.
- ✅ All 62 unique title(s) are within the allowed 10% duplicity. Highest duplicity title has 3%.
- ✅ All pages have quoted attributes.
- ✅ All pages have inline SVGs smaller than 5120 bytes.
- ✅ All pages have inline SVGs with less than 5 duplicates.
- ✅ All pages have valid or none inline SVGs.
- ✅ All pages have DOM depth less than 30.
- ✅ All pages have clickable (interactive) phone numbers.
- ✅ All pages have valid HTML.
- ✅ All pages have image alt attributes.
- ✅ All pages have lang attribute.
- ✅ DNS IPv4 OK: domain docs.crawl4ai.com resolved to 35.163.245.47 (DNS server: 127.0.0.53).
Visited URLs
Found 71 row(s).
| URL | Status | Type | Time (s) | Size | Cache |
|---|---|---|---|---|---|
| / | 200 | HTML | 180 ms | 42 kB | ETag-only |
| /advanced/lazy-loading/ | 200 | HTML | 179 ms | 43 kB | ETag-only |
| /marketplace/ | 200 | HTML | 179 ms | 6 kB | ETag-only |
| /advanced/file-downloading/ | 200 | HTML | 179 ms | 49 kB | ETag-only |
| /advanced/network-console-capture/ | 200 | HTML | 179 ms | 62 kB | ETag-only |
| /core/simple-crawling/ | 200 | HTML | 250 ms | 54 kB | ETag-only |
| /core/self-hosting/ | 200 | HTML | 360 ms | 372 kB | ETag-only |
| /core/deep-crawling/ | 200 | HTML | 236 ms | 159 kB | ETag-only |
| /core/page-interaction/ | 200 | HTML | 224 ms | 86 kB | ETag-only |
| /extraction/clustring-strategies/ | 200 | HTML | 180 ms | 57 kB | ETag-only |
| /core/crawler-result/ | 200 | HTML | 203 ms | 80 kB | ETag-only |
| /core/markdown-generation/ | 200 | HTML | 180 ms | 98 kB | ETag-only |
| /core/content-selection/ | 200 | HTML | 180 ms | 115 kB | ETag-only |
| /advanced/adaptive-strategies/ | 200 | HTML | 179 ms | 77 kB | ETag-only |
| /blog/ | 200 | HTML | 179 ms | 36 kB | ETag-only |
| /api/arun/ | 200 | HTML | 179 ms | 65 kB | ETag-only |
| /api/strategies/ | 200 | HTML | 179 ms | 91 kB | ETag-only |
| /advanced/hooks-auth/ | 200 | HTML | 179 ms | 67 kB | ETag-only |
| /advanced/advanced-features/ | 200 | HTML | 180 ms | 95 kB | ETag-only |
| /core/c4a-script/ | 200 | HTML | 180 ms | 60 kB | ETag-only |
| /core/installation/ | 200 | HTML | 179 ms | 40 kB | ETag-only |
| /api/crawl-result/ | 200 | HTML | 180 ms | 95 kB | ETag-only |
| /advanced/proxy-security/ | 200 | HTML | 180 ms | 78 kB | ETag-only |
| /api/parameters/ | 200 | HTML | 180 ms | 105 kB | ETag-only |
| /core/browser-crawler-config/ | 200 | HTML | 180 ms | 87 kB | ETag-only |
| /core/cli/ | 200 | HTML | 179 ms | 61 kB | ETag-only |
| /advanced/session-management/ | 200 | HTML | 180 ms | 74 kB | ETag-only |
| /api/arun_many/ | 200 | HTML | 179 ms | 55 kB | ETag-only |
| /advanced/multi-url-crawling/ | 200 | HTML | 180 ms | 107 kB | ETag-only |
| /branding/ | 200 | HTML | 180 ms | 81 kB | ETag-only |
| /advanced/crawl-dispatcher/ | 200 | HTML | 179 ms | 31 kB | ETag-only |
| /extraction/llm-strategies/ | 200 | HTML | 180 ms | 73 kB | ETag-only |
| /core/ask-ai/ | 200 | HTML | 179 ms | 32 kB | ETag-only |
| /apps/llmtxt/ | 200 | HTML | 179 ms | 6 kB | ETag-only |
| /marketplace/admin/ | 200 | HTML | 179 ms | 10 kB | ETag-only |
| /core/url-seeding/ | 200 | HTML | 181 ms | 220 kB | ETag-only |
| /core/fit-markdown/ | 200 | HTML | 179 ms | 59 kB | ETag-only |
| /core/local-files/ | 200 | HTML | 179 ms | 60 kB | ETag-only |
| /advanced/virtual-scroll/ | 200 | HTML | 179 ms | 69 kB | ETag-only |
| /advanced/ssl-certificate/ | 200 | HTML | 179 ms | 49 kB | ETag-only |
| /extraction/no-llm-strategies/ | 200 | HTML | 181 ms | 163 kB | ETag-only |
| /extraction/chunking/ | 200 | HTML | 179 ms | 55 kB | ETag-only |
| /api/async-webcrawler/ | 200 | HTML | 180 ms | 75 kB | ETag-only |
| /core/adaptive-crawling/ | 200 | HTML | 179 ms | 69 kB | ETag-only |
| /api/c4a-script-reference/ | 200 | HTML | 180 ms | 79 kB | ETag-only |
| /advanced/anti-bot-and-fallback/ | 200 | HTML | 180 ms | 67 kB | ETag-only |
| /core/link-media/ | 200 | HTML | 180 ms | 142 kB | ETag-only |
| /apps/c4a-script/ | 200 | HTML | 179 ms | 9 kB | ETag-only |
| /CONTRIBUTING/ | 200 | HTML | 179 ms | 38 kB | ETag-only |
| /advanced/undetected-browser/ | 200 | HTML | 180 ms | 78 kB | ETag-only |
| /advanced/identity-based-crawling/ | 200 | HTML | 180 ms | 80 kB | ETag-only |
| /core/examples/ | 200 | HTML | 179 ms | 47 kB | ETag-only |
| /apps/ | 200 | HTML | 179 ms | 38 kB | ETag-only |
| /stats/ | 200 | HTML | 179 ms | 48 kB | ETag-only |
| /advanced/pdf-parsing/ | 200 | HTML | 180 ms | 65 kB | ETag-only |
| /core/cache-modes/ | 200 | HTML | 179 ms | 39 kB | ETag-only |
| /core/quickstart/ | 200 | HTML | 180 ms | 101 kB | ETag-only |
| /blog/articles/llm-context-revolution/ | 200 | HTML | 179 ms | 47 kB | ETag-only |
| /blog/articles/adaptive-crawling-revolution/ | 200 | HTML | 179 ms | 58 kB | ETag-only |
| /c4a-script/demo | 404 | HTML | 179 ms | 29 kB | ETag-only |
| /examples/c4a_script/ | 404 | HTML | 179 ms | 29 kB | ETag-only |
| /examples/c4a_script/tutorial/ | 404 | HTML | 179 ms | 29 kB | ETag-only |
| /docs/md_v2/apps/ | 404 | HTML | 179 ms | 29 kB | ETag-only |
| /docs/md_v2/assets/ | 404 | HTML | 179 ms | 29 kB | ETag-only |
| /api/parameters | 301 | Redirect | 179 ms | 147 B | None |
| /blog/articles/llm-context-revolution | 301 | Redirect | 179 ms | 191 B | None |
| /api/adaptive-crawler/ | 200 | HTML | 180 ms | 54 kB | ETag-only |
| /api/examples/c4a_script/tutorial/ | 404 | HTML | 179 ms | 29 kB | ETag-only |
| /apps/crawl4ai-assistant/ | 200 | HTML | 179 ms | 51 kB | ETag-only |
| /core/llmtxt/ | 200 | HTML | 179 ms | 31 kB | ETag-only |
| /api/digest/ | 200 | HTML | 179 ms | 49 kB | ETag-only |
| No rows found, please edit your search term. | |||||
Best practices
| Analysis name | OK | Notice | Warning | Critical |
|---|---|---|---|---|
| DOM depth (> 30) | 69 | 0 | 0 | 0 |
| Heading structure | 61 | 0 | 68 | 5 |
| Title uniqueness (> 10%) | 62 | 0 | 0 | 0 |
| Description uniqueness (> 10%) | 1 | 0 | 1 | 0 |
| Brotli support | 0 | 0 | 63 | 0 |
| WebP support | 0 | 0 | 1 | 0 |
| AVIF support | 0 | 0 | 1 | 0 |
Large inline SVGs
No problems found.
Duplicate inline SVGs
No problems found.
Invalid inline SVGs
No problems found.
Missing quotes on attributes
No problems found.
DOM depth
No problems found.
Heading structure
Found 10 row(s).
| Severity | Occurs | Detail | Affected URLs (max 5) |
|---|---|---|---|
| critical | 11 | Multiple <h1> headings found. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| critical | 10 | No <h1> tag found in the HTML content. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 47 | Heading structure is skipping levels: found an <h5> after an <h2>. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 8 | Heading structure is skipping levels: found an <h5> without a previous higher heading. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 6 | Heading structure is skipping levels: found an <h5> after an <h3>. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 3 | Heading structure is skipping levels: found an <h4> after an <h2>. | URL 1, URL 2, URL 3 |
| warning | 3 | Heading structure is skipping levels: found an <h4> after an <h1>. | URL 1, URL 2, URL 3 |
| warning | 2 | Heading structure is skipping levels: found an <h3> after an <h1>. | URL 1, URL 2 |
| warning | 2 | Heading structure is skipping levels: found an <h2> without a previous higher heading. | URL 1, URL 2 |
| warning | 2 | Heading structure is skipping levels: found an <h5> after an <h1>. | URL 1, URL 2 |
| No rows found, please edit your search term. | |||
Non-clickable phone numbers
No problems found.
Title uniqueness
No problems found.
Description uniqueness
No problems found.
Accessibility
| Analysis name | OK | Notice | Warning | Critical |
|---|---|---|---|---|
| Missing aria labels | 1 | 0 | 2980 | 14 |
| Missing html lang attribute | 1 | 0 | 0 | 0 |
| Missing form labels | 0 | 0 | 7 | 0 |
| Missing image alt attributes | 17 | 0 | 0 | 0 |
| Missing roles | 0 | 0 | 8 | 0 |
Valid HTML
No problems found.
Missing image alt attributes
No problems found.
Missing form labels
| Severity | Occurs | Detail | Affected URLs (max 5) |
|---|---|---|---|
| warning | 58 | <input class="form-*" id="mkdocs-search-query" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 1 | <input id="password" *** > | /marketplace/admin/ |
| warning | 1 | <input class="search-*" id="apps-search" *** > | /marketplace/admin/ |
| warning | 1 | <input id="edit-value" *** > | /apps/c4a-script/ |
| warning | 1 | <input id="edit-selector" *** > | /apps/c4a-script/ |
| warning | 1 | <input id="search-input" *** > | /marketplace/ |
| warning | 1 | <input class="search-*" id="articles-search" *** > | /marketplace/admin/ |
Missing aria labels
Found 186 row(s).
| Severity | Occurs | Detail | Affected URLs (max 5) |
|---|---|---|---|
| critical | 1 | <select class="filter-*" id="apps-filter"> | /marketplace/admin/ |
| critical | 1 | <input class="search-*" id="apps-search" *** > | /marketplace/admin/ |
| critical | 1 | <select id="useCase" name="useCase"> | /apps/crawl4ai-assistant/ |
| critical | 1 | <textarea id="c4a-editor" *** > | /apps/c4a-script/ |
| critical | 1 | <input id="userName" name="name" *** > | /apps/crawl4ai-assistant/ |
| critical | 1 | <input id="userEmail" name="email" *** > | /apps/crawl4ai-assistant/ |
| critical | 1 | <input id="edit-value" *** > | /apps/c4a-script/ |
| critical | 1 | <select id="edit-command-type" *** > | /apps/c4a-script/ |
| critical | 1 | <select class="mini-*" id="type-filter"> | /marketplace/ |
| critical | 1 | <input id="search-input" *** > | /marketplace/ |
| critical | 1 | <input id="userCompany" name="company" *** > | /apps/crawl4ai-assistant/ |
| critical | 1 | <input id="edit-selector" *** > | /apps/c4a-script/ |
| critical | 1 | <input id="password" *** > | /marketplace/admin/ |
| critical | 1 | <input class="search-*" id="articles-search" *** > | /marketplace/admin/ |
| critical | 1 | <select id="edit-direction"> | /apps/c4a-script/ |
| warning | 3427 | <a class="terminal-*" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 957 | <a id="__codelineno-0-***" name="__codelineno-0-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 860 | <a id="__codelineno-1-***" name="__codelineno-1-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 745 | <a ***> | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 717 | <a id="__codelineno-3-***" name="__codelineno-3-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 686 | <a id="__codelineno-4-***" name="__codelineno-4-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 637 | <a id="__codelineno-5-***" name="__codelineno-5-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 607 | <a id="__codelineno-2-***" name="__codelineno-2-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 579 | <a id="__codelineno-8-***" name="__codelineno-8-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 544 | <a id="__codelineno-6-***" name="__codelineno-6-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 400 | <a class="menu-*" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 398 | <a id="__codelineno-11-***" name="__codelineno-11-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 336 | <a id="__codelineno-7-***" name="__codelineno-7-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 299 | <a id="__codelineno-10-***" name="__codelineno-10-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 286 | <a id="__codelineno-9-***" name="__codelineno-9-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 266 | <a id="__codelineno-24-***" name="__codelineno-24-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 201 | <a id="__codelineno-13-***" name="__codelineno-13-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 196 | <a id="__codelineno-12-***" name="__codelineno-12-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 193 | <a id="__codelineno-14-***" name="__codelineno-14-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 165 | <a id="__codelineno-15-***" name="__codelineno-15-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 131 | <a id="__codelineno-16-***" name="__codelineno-16-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 116 | <a id="__codelineno-19-***" name="__codelineno-19-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 112 | <a id="__codelineno-22-***" name="__codelineno-22-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 91 | <a id="__codelineno-17-***" name="__codelineno-17-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 89 | <a id="__codelineno-18-***" name="__codelineno-18-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 82 | <a id="__codelineno-71-***" name="__codelineno-71-***" *** > | URL 1, URL 2 |
| warning | 78 | <a id="__codelineno-20-***" name="__codelineno-20-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 74 | <a id="__codelineno-21-***" name="__codelineno-21-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 69 | <a id="__codelineno-38-***" name="__codelineno-38-***" *** > | URL 1, URL 2 |
| warning | 68 | <a id="__codelineno-23-***" name="__codelineno-23-***" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 67 | <a id="__codelineno-43-***" name="__codelineno-43-***" *** > | URL 1, URL 2 |
| warning | 66 | <a id="__codelineno-44-***" name="__codelineno-44-***" *** > | URL 1, URL 2 |
| warning | 63 | <a id="__codelineno-105-***" name="__codelineno-105-***" *** > | /core/self-hosting/ |
| warning | 62 | <a id="__codelineno-57-***" name="__codelineno-57-***" *** > | URL 1, URL 2 |
| warning | 58 | <a class="no-*" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 58 | <button class="close btn btn-* btn-*" *** > | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 55 | <a id="__codelineno-32-***" name="__codelineno-32-***" *** > | URL 1, URL 2, URL 3 |
| warning | 52 | <a id="__codelineno-77-***" name="__codelineno-77-***" *** > | /core/self-hosting/ |
| warning | 50 | <a id="__codelineno-33-***" name="__codelineno-33-***" *** > | URL 1, URL 2, URL 3 |
| warning | 46 | <a id="__codelineno-26-***" name="__codelineno-26-***" *** > | URL 1, URL 2, URL 3, URL 4 |
| warning | 44 | <a id="__codelineno-72-***" name="__codelineno-72-***" *** > | URL 1, URL 2 |
| warning | 42 | <a id="__codelineno-36-***" name="__codelineno-36-***" *** > | URL 1, URL 2 |
| warning | 40 | <a id="__codelineno-31-***" name="__codelineno-31-***" *** > | URL 1, URL 2, URL 3 |
| warning | 37 | <a id="__codelineno-58-***" name="__codelineno-58-***" *** > | URL 1, URL 2 |
| warning | 37 | <a id="__codelineno-35-***" name="__codelineno-35-***" *** > | URL 1, URL 2 |
| warning | 35 | <a id="__codelineno-75-***" name="__codelineno-75-***" *** > | /core/self-hosting/ |
| warning | 31 | <a id="__codelineno-42-***" name="__codelineno-42-***" *** > | URL 1, URL 2 |
| warning | 31 | <a id="__codelineno-37-***" name="__codelineno-37-***" *** > | URL 1, URL 2 |
| warning | 31 | <a id="__codelineno-59-***" name="__codelineno-59-***" *** > | URL 1, URL 2 |
| warning | 30 | <a id="__codelineno-29-***" name="__codelineno-29-***" *** > | URL 1, URL 2, URL 3 |
| warning | 29 | <a id="__codelineno-34-***" name="__codelineno-34-***" *** > | URL 1, URL 2, URL 3 |
| warning | 29 | <a id="__codelineno-91-***" name="__codelineno-91-***" *** > | /core/self-hosting/ |
| warning | 28 | <a id="__codelineno-68-***" name="__codelineno-68-***" *** > | URL 1, URL 2 |
| warning | 28 | <a id="__codelineno-41-***" name="__codelineno-41-***" *** > | URL 1, URL 2 |
| warning | 27 | <a id="__codelineno-76-***" name="__codelineno-76-***" *** > | /core/self-hosting/ |
| warning | 27 | <a id="__codelineno-83-***" name="__codelineno-83-***" *** > | /core/self-hosting/ |
| warning | 24 | <a id="__codelineno-30-***" name="__codelineno-30-***" *** > | URL 1, URL 2, URL 3 |
| warning | 23 | <a id="__codelineno-73-***" name="__codelineno-73-***" *** > | /core/self-hosting/ |
| warning | 23 | <a id="__codelineno-25-***" name="__codelineno-25-***" *** > | URL 1, URL 2, URL 3, URL 4 |
| warning | 22 | <a id="__codelineno-101-***" name="__codelineno-101-***" *** > | /core/self-hosting/ |
| warning | 20 | <a id="__codelineno-46-***" name="__codelineno-46-***" *** > | URL 1, URL 2 |
| warning | 19 | <a id="__codelineno-80-***" name="__codelineno-80-***" *** > | /core/self-hosting/ |
| warning | 19 | <a id="__codelineno-55-***" name="__codelineno-55-***" *** > | URL 1, URL 2 |
| warning | 18 | <a id="__codelineno-100-***" name="__codelineno-100-***" *** > | /core/self-hosting/ |
| warning | 17 | <a id="__codelineno-61-***" name="__codelineno-61-***" *** > | URL 1, URL 2 |
| warning | 17 | <a id="__codelineno-102-***" name="__codelineno-102-***" *** > | /core/self-hosting/ |
| warning | 17 | <a id="__codelineno-27-***" name="__codelineno-27-***" *** > | URL 1, URL 2, URL 3 |
| warning | 17 | <a id="__codelineno-70-***" name="__codelineno-70-***" *** > | URL 1, URL 2 |
| warning | 17 | <a id="__codelineno-49-***" name="__codelineno-49-***" *** > | URL 1, URL 2 |
| warning | 16 | <a id="__codelineno-92-***" name="__codelineno-92-***" *** > | /core/self-hosting/ |
| warning | 16 | <a id="__codelineno-85-***" name="__codelineno-85-***" *** > | /core/self-hosting/ |
| warning | 15 | <a id="__codelineno-63-***" name="__codelineno-63-***" *** > | URL 1, URL 2 |
| warning | 14 | <a id="__codelineno-69-***" name="__codelineno-69-***" *** > | URL 1, URL 2 |
| warning | 13 | <a id="__codelineno-28-***" name="__codelineno-28-***" *** > | URL 1, URL 2, URL 3 |
| warning | 13 | <a id="__codelineno-54-***" name="__codelineno-54-***" *** > | URL 1, URL 2 |
| warning | 12 | <a id="__codelineno-67-***" name="__codelineno-67-***" *** > | URL 1, URL 2 |
| warning | 12 | <a id="__codelineno-40-***" name="__codelineno-40-***" *** > | URL 1, URL 2 |
| warning | 11 | <button class="copy-*" *** > | URL 1, URL 2 |
| warning | 11 | <a id="__codelineno-53-***" name="__codelineno-53-***" *** > | URL 1, URL 2 |
| warning | 11 | <a id="__codelineno-60-***" name="__codelineno-60-***" *** > | URL 1, URL 2 |
| warning | 11 | <a id="__codelineno-74-***" name="__codelineno-74-***" *** > | /core/self-hosting/ |
| warning | 10 | <a id="__codelineno-39-***" name="__codelineno-39-***" *** > | URL 1, URL 2 |
| warning | 10 | <a id="__codelineno-107-***" name="__codelineno-107-***" *** > | /core/self-hosting/ |
| warning | 10 | <a id="__codelineno-64-***" name="__codelineno-64-***" *** > | URL 1, URL 2 |
| warning | 9 | <a id="__codelineno-56-***" name="__codelineno-56-***" *** > | URL 1, URL 2 |
| You have reached the limit of 100 rows as a protection against very large output or exhausted memory. | |||
| No rows found, please edit your search term. | |||
Missing roles
Found 14 row(s).
| Severity | Occurs | Detail | Affected URLs (max 5) |
|---|---|---|---|
| warning | 113 | <nav> | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 58 | <main id="terminal-mkdocs-main-content"> | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 58 | <nav class="terminal-*"> | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 58 | <aside id="terminal-mkdocs-side-panel"> | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 58 | <header class="terminal-*"> | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 2 | <nav class="nav-*"> | URL 1, URL 2 |
| warning | 1 | <aside class="admin-*"> | /marketplace/admin/ |
| warning | 1 | <header class="admin-*"> | /marketplace/admin/ |
| warning | 1 | <header class="marketplace-*"> | /marketplace/ |
| warning | 1 | <main class="magazine-*"> | /marketplace/ |
| warning | 1 | <footer class="marketplace-*"> | /marketplace/ |
| warning | 1 | <footer class="footer"> | /apps/crawl4ai-assistant/ |
| warning | 1 | <nav class="sidebar-*"> | /marketplace/admin/ |
| warning | 1 | <main class="admin-*"> | /marketplace/admin/ |
| No rows found, please edit your search term. | |||
Missing html lang attribute
No problems found.
Security
| Header | OK | Notice | Warning | Critical | Recommendation |
|---|---|---|---|---|---|
| Strict-Transport-Security | 0 | 0 | 0 | 69 | Strict-Transport-Security header is not set. It enforces secure connections and protects against MITM attacks. |
| Content-Security-Policy | 0 | 0 | 0 | 69 | Content-Security-Policy header is not set. It restricts resources the page can load and prevents XSS attacks. |
| Server | 0 | 0 | 0 | 69 | Server header is set to 'nginx/1.24.0 (Ubuntu)'. It is better not to reveal the technologies used and especially their versions. |
| X-Frame-Options | 0 | 0 | 69 | 0 | X-Frame-Options header is not set. It prevents clickjacking attacks when set to 'deny' or 'sameorigin. |
| X-Content-Type-Options | 0 | 0 | 69 | 0 | X-Content-Type-Options header is not set. It stops MIME type sniffing and mitigates content type attacks. |
| Referrer-Policy | 0 | 0 | 69 | 0 | Referrer-Policy header is not set. It controls referrer header sharing and enhances privacy and security. |
| Feature-Policy | 0 | 0 | 69 | 0 | Feature-Policy header is not set. It allows enabling/disabling browser APIs and features for security. Not important if Permissions-Policy is set. |
| Permissions-Policy | 0 | 0 | 69 | 0 | Permissions-Policy header is not set. It allows enabling/disabling browser APIs and features for security. |
| X-XSS-Protection | 69 | 0 | 0 | 0 |
Security headers
| Severity | Occurs | Detail | Affected URLs (max 5) |
|---|---|---|---|
| critical | 69 | Strict-Transport-Security header is not set. It enforces secure connections and protects against MITM attacks. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| critical | 69 | Content-Security-Policy header is not set. It restricts resources the page can load and prevents XSS attacks. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| critical | 69 | Server header is set to 'nginx/1.24.0 (Ubuntu)'. It is better not to reveal the technologies used and especially their versions. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 69 | X-Frame-Options header is not set. It prevents clickjacking attacks when set to 'deny' or 'sameorigin. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 69 | Permissions-Policy header is not set. It allows enabling/disabling browser APIs and features for security. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 69 | X-Content-Type-Options header is not set. It stops MIME type sniffing and mitigates content type attacks. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 69 | Referrer-Policy header is not set. It controls referrer header sharing and enhances privacy and security. | URL 1, URL 2, URL 3, URL 4, URL 5 |
| warning | 69 | Feature-Policy header is not set. It allows enabling/disabling browser APIs and features for security. Not important if Permissions-Policy is set. | URL 1, URL 2, URL 3, URL 4, URL 5 |
TOP non-unique titles
| Count 🔽 | Title |
|---|---|
| 2 | Browser, Crawler & LLM Config - Crawl4AI Documentation (v0.8.x) |
TOP non-unique descriptions
| Count 🔽 | Description |
|---|---|
| 58 | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper |
| 5 |
SEO metadata
Found 63 row(s).
| URL 🔼 | Indexing | Title | H1 | Description | Keywords |
|---|---|---|---|---|---|
| / | Allowed | Home - Crawl4AI Documentation (v0.8.x) | 🚀🤖 Crawl4AI: Open-Source LLM-Friendly Web Crawler & Scraper | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /CONTRIBUTING/ | Allowed | Contributing Guide - Crawl4AI Documentation (v0.8.x) | Contributing to Crawl4AI | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/adaptive-strategies/ | Allowed | Adaptive Strategies - Crawl4AI Documentation (v0.8.x) | Advanced Adaptive Strategies | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/advanced-features/ | Allowed | Overview - Crawl4AI Documentation (v0.8.x) | Overview of Some Important Advanced Features | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/anti-bot-and-fallback/ | Allowed | Anti-Bot & Fallback - Crawl4AI Documentation (v0.8.x) | Anti-Bot Detection & Fallback | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/crawl-dispatcher/ | Allowed | Crawl Dispatcher - Crawl4AI Documentation (v0.8.x) | Crawl Dispatcher | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/file-downloading/ | Allowed | File Downloading - Crawl4AI Documentation (v0.8.x) | Download Handling in Crawl4AI | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/hooks-auth/ | Allowed | Hooks & Auth - Crawl4AI Documentation (v0.8.x) | Hooks & Auth in AsyncWebCrawler | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/identity-based-crawling/ | Allowed | Identity Based Crawling - Crawl4AI Documentation (v0.8.x) | Preserve Your Identity with Crawl4AI | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/lazy-loading/ | Allowed | Lazy Loading - Crawl4AI Documentation (v0.8.x) | Missing H1 | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/multi-url-crawling/ | Allowed | Multi-URL Crawling - Crawl4AI Documentation (v0.8.x) | Advanced Multi-URL Crawling with Dispatchers | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/network-console-capture/ | Allowed | Network & Console Capture - Crawl4AI Documentation (v0.8.x) | Network Requests & Console Message Capturing | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/pdf-parsing/ | Allowed | PDF Parsing - Crawl4AI Documentation (v0.8.x) | PDF Processing Strategies | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/proxy-security/ | Allowed | Proxy & Security - Crawl4AI Documentation (v0.8.x) | Proxy & Security | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/session-management/ | Allowed | Session Management - Crawl4AI Documentation (v0.8.x) | Session Management | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/ssl-certificate/ | Allowed | SSL Certificate - Crawl4AI Documentation (v0.8.x) | SSLCertificate Reference | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/undetected-browser/ | Allowed | Undetected Browser - Crawl4AI Documentation (v0.8.x) | Undetected Browser Mode | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /advanced/virtual-scroll/ | Allowed | Virtual Scroll - Crawl4AI Documentation (v0.8.x) | Virtual Scroll | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /api/adaptive-crawler/ | Allowed | AdaptiveCrawler - Crawl4AI Documentation (v0.8.x) | AdaptiveCrawler | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /api/arun/ | Allowed | arun() - Crawl4AI Documentation (v0.8.x) | arun() Parameter Guide (New Approach) | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /api/arun_many/ | Allowed | arun_many() - Crawl4AI Documentation (v0.8.x) | arun_many(...) Reference | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /api/async-webcrawler/ | Allowed | AsyncWebCrawler - Crawl4AI Documentation (v0.8.x) | AsyncWebCrawler | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /api/c4a-script-reference/ | Allowed | C4A-Script Reference - Crawl4AI Documentation (v0.8.x) | C4A-Script API Reference | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /api/crawl-result/ | Allowed | CrawlResult - Crawl4AI Documentation (v0.8.x) | CrawlResult Reference | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /api/digest/ | Allowed | digest() - Crawl4AI Documentation (v0.8.x) | digest() | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /api/parameters/ | Allowed | Browser, Crawler & LLM Config - Crawl4AI Documentation (v0.8.x) | 1. BrowserConfig – Controlling the Browser | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /api/strategies/ | Allowed | Strategies - Crawl4AI Documentation (v0.8.x) | Extraction & Chunking Strategies API | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /apps/ | Allowed | Demo Apps - Crawl4AI Documentation (v0.8.x) | 🚀 Crawl4AI Interactive Apps | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /apps/c4a-script/ | Allowed | C4A-Script Interactive Tutorial | Crawl4AI | Missing H1 | ||
| /apps/crawl4ai-assistant/ | Allowed | Crawl4AI Assistant - Chrome Extension for Visual Web Scraping | Crawl4AI Assistant | ||
| /apps/llmtxt/ | Allowed | Crawl4AI LLM Context Builder | Crawl4AI LLM Context Builder | ||
| /blog/ | Allowed | Blog Home - Crawl4AI Documentation (v0.8.x) | Crawl4AI Blog | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /blog/articles/adaptive-crawling-revolution/ | Allowed | Adaptive Crawling: Building Dynamic Knowledge That Grows on Demand - Crawl4AI Documentation (v0.8.x) | Adaptive Crawling: Building Dynamic Knowledge That Grows on Demand | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /blog/articles/llm-context-revolution/ | Allowed | The LLM Context Protocol: Why Your AI Assistant Needs Memory, Reasoning, and Examples - Crawl4AI Documentation (v0.8.x) | The LLM Context Protocol: Why Your AI Assistant Needs Memory, Reasoning, and Examples | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /branding/ | Allowed | Brand Book - Crawl4AI Documentation (v0.8.x) | 🎨 Crawl4AI Brand Book | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/adaptive-crawling/ | Allowed | Adaptive Crawling - Crawl4AI Documentation (v0.8.x) | Adaptive Web Crawling | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/ask-ai/ | Allowed | Ask AI - Crawl4AI Documentation (v0.8.x) | Missing H1 | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/browser-crawler-config/ | Allowed | Browser, Crawler & LLM Config - Crawl4AI Documentation (v0.8.x) | Browser, Crawler & LLM Configuration (Quick Overview) | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/c4a-script/ | Allowed | C4A-Script - Crawl4AI Documentation (v0.8.x) | C4A-Script: Visual Web Automation Made Simple | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/cache-modes/ | Allowed | Cache Modes - Crawl4AI Documentation (v0.8.x) | Crawl4AI Cache System and Migration Guide | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/cli/ | Allowed | Command Line Interface - Crawl4AI Documentation (v0.8.x) | Crawl4AI CLI Guide | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/content-selection/ | Allowed | Content Selection - Crawl4AI Documentation (v0.8.x) | Content Selection | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/crawler-result/ | Allowed | Crawler Result - Crawl4AI Documentation (v0.8.x) | Crawl Result and Output | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/deep-crawling/ | Allowed | Deep Crawling - Crawl4AI Documentation (v0.8.x) | Deep Crawling | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/examples/ | Allowed | Code Examples - Crawl4AI Documentation (v0.8.x) | Code Examples | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/fit-markdown/ | Allowed | Fit Markdown - Crawl4AI Documentation (v0.8.x) | Fit Markdown with Pruning & BM25 | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/installation/ | Allowed | Installation - Crawl4AI Documentation (v0.8.x) | Installation & Setup (2023 Edition) | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/link-media/ | Allowed | Link & Media - Crawl4AI Documentation (v0.8.x) | Link & Media | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/llmtxt/ | Allowed | Llmtxt - Crawl4AI Documentation (v0.8.x) | Missing H1 | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/local-files/ | Allowed | Local Files & Raw HTML - Crawl4AI Documentation (v0.8.x) | Prefix-Based Input Handling in Crawl4AI | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/markdown-generation/ | Allowed | Markdown Generation - Crawl4AI Documentation (v0.8.x) | Markdown Generation Basics | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/page-interaction/ | Allowed | Page Interaction - Crawl4AI Documentation (v0.8.x) | Page Interaction | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/quickstart/ | Allowed | Quick Start - Crawl4AI Documentation (v0.8.x) | Getting Started with Crawl4AI | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/self-hosting/ | Allowed | Self-Hosting Guide - Crawl4AI Documentation (v0.8.x) | Self-Hosting Crawl4AI 🚀 | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/simple-crawling/ | Allowed | Simple Crawling - Crawl4AI Documentation (v0.8.x) | Simple Crawling | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /core/url-seeding/ | Allowed | URL Seeding - Crawl4AI Documentation (v0.8.x) | URL Seeding: The Smart Way to Crawl at Scale | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /extraction/chunking/ | Allowed | Chunking - Crawl4AI Documentation (v0.8.x) | Chunking Strategies | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /extraction/clustring-strategies/ | Allowed | Clustering Strategies - Crawl4AI Documentation (v0.8.x) | Cosine Strategy | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /extraction/llm-strategies/ | Allowed | LLM Strategies - Crawl4AI Documentation (v0.8.x) | Extracting JSON (LLM) | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /extraction/no-llm-strategies/ | Allowed | LLM-Free Strategies - Crawl4AI Documentation (v0.8.x) | Extracting JSON (No LLM) | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| /marketplace/ | Allowed | Marketplace - Crawl4AI | [ Marketplace ] | ||
| /marketplace/admin/ | Allowed | Admin Dashboard - Crawl4AI Marketplace | [ Admin Access ] | ||
| /stats/ | Allowed | Growth - Crawl4AI Documentation (v0.8.x) | Growth | 🚀🤖 Crawl4AI, Open-source LLM-Friendly Web Crawler & Scraper | |
| No rows found, please edit your search term. | |||||
OpenGraph metadata
No URLs with OpenGraph data (og:* or twitter:* meta tags).
Heading structure
Found 63 row(s).
| Heading structure | Count | Errors 🔽 | URL |
|---|---|---|---|
| 36 | 5 | /extraction/no-llm-strategies/ |
| 4 | 4 | /advanced/lazy-loading/ |
| 4 | 4 | /apps/c4a-script/ |
| 20 | 4 | /apps/crawl4ai-assistant/ |
| 24 | 3 | /api/parameters/ |
| 42 | 3 | /branding/ |
| 6 | 3 | /core/local-files/ |
| 4 | 3 | /extraction/chunking/ |
| 9 | 2 | /marketplace/admin/ |
| 10 | 0 | / |
| 8 | 0 | /marketplace/ |
| 7 | 0 | /advanced/file-downloading/ |
| 8 | 0 | /advanced/network-console-capture/ |
| 7 | 0 | /core/simple-crawling/ |
| 60 | 0 | /core/self-hosting/ |
| 44 | 0 | /core/deep-crawling/ |
| 18 | 0 | /core/page-interaction/ |
| 15 | 0 | /extraction/clustring-strategies/ |
| 22 | 0 | /core/crawler-result/ |
| 23 | 0 | /core/markdown-generation/ |
| 18 | 0 | /core/content-selection/ |
| 29 | 0 | /advanced/adaptive-strategies/ |
| 13 | 0 | /blog/ |
| 18 | 0 | /api/arun/ |
| 16 | 0 | /api/strategies/ |
| 6 | 0 | /advanced/hooks-auth/ |
| 16 | 0 | /advanced/advanced-features/ |
| 34 | 0 | /core/c4a-script/ |
| 11 | 0 | /core/installation/ |
| 32 | 0 | /api/crawl-result/ |
| 20 | 0 | /advanced/proxy-security/ |
| 13 | 0 | /core/browser-crawler-config/ |
| 17 | 0 | /core/cli/ |
| 4 | 0 | /advanced/session-management/ |
| 11 | 0 | /api/arun_many/ |
| 20 | 0 | /advanced/multi-url-crawling/ |
| 1 | 0 | /advanced/crawl-dispatcher/ |
| 17 | 0 | /extraction/llm-strategies/ |
| 0 | 0 | /core/ask-ai/ | |
| 5 | 0 | /apps/llmtxt/ |
| 35 | 0 | /core/url-seeding/ |
| 17 | 0 | /core/fit-markdown/ |
| 19 | 0 | /advanced/virtual-scroll/ |
| 16 | 0 | /advanced/ssl-certificate/ |
| 13 | 0 | /api/async-webcrawler/ |
| 35 | 0 | /core/adaptive-crawling/ |
| 21 | 0 | /api/c4a-script-reference/ |
| 14 | 0 | /advanced/anti-bot-and-fallback/ |
| 20 | 0 | /core/link-media/ |
| 9 | 0 | /CONTRIBUTING/ |
| 29 | 0 | /advanced/undetected-browser/ |
| 19 | 0 | /advanced/identity-based-crawling/ |
| 14 | 0 | /core/examples/ |
| 16 | 0 | /apps/ |
| 6 | 0 | /stats/ |
| 14 | 0 | /advanced/pdf-parsing/ |
| 9 | 0 | /core/cache-modes/ |
| 12 | 0 | /core/quickstart/ |
| 10 | 0 | /blog/articles/llm-context-revolution/ |
| 27 | 0 | /blog/articles/adaptive-crawling-revolution/ |
| 19 | 0 | /api/adaptive-crawler/ |
| 0 | 0 | /core/llmtxt/ | |
| 18 | 0 | /api/digest/ |
| No rows found, please edit your search term. | |||
404 URLs
Redirected URLs
| Status | Redirected URL 🔼 | Target URL | Found at URL |
|---|---|---|---|
| 301 | /api/parameters | /api/parameters/ | /extraction/llm-strategies/ |
| 301 | /blog/articles/llm-context-revolution | /blog/articles/llm-context-revolution/ | /apps/llmtxt/ |
Skipped URLs Summary
Found 14 row(s).
| Reason | Domain | Unique URLs 🔽 |
|---|---|---|
| Not allowed host | github.com | 14 |
| Not allowed host | discord.gg | 2 |
| Not allowed host | x.com | 2 |
| Not allowed host | www.nstproxy.com | 1 |
| Not allowed host | docs.litellm.ai | 1 |
| Not allowed host | pypi.org | 1 |
| Not allowed host | trendshift.io | 1 |
| Not allowed host | twitter.com | 1 |
| Not allowed host | www.capsolver.com | 1 |
| Not allowed host | pepy.tech | 1 |
| Not allowed host | forms.gle | 1 |
| Not allowed host | www.linkedin.com | 1 |
| Not allowed host | badge.fury.io | 1 |
| Not allowed host | developer.mozilla.org | 1 |
| No rows found, please edit your search term. | ||
Skipped URLs
Found 29 row(s).
External URLs
29 external URL(s) Found 29 row(s).
TOP fastest URLs
Found 20 row(s).
| Time 🔼 | Status | Fast URL |
|---|---|---|
| 178 ms | 200 | /marketplace/admin/ |
| 178 ms | 200 | /apps/c4a-script/ |
| 178 ms | 200 | /apps/llmtxt/ |
| 178 ms | 200 | /marketplace/ |
| 179 ms | 200 | /core/llmtxt/ |
| 179 ms | 200 | /core/ask-ai/ |
| 179 ms | 200 | /advanced/crawl-dispatcher/ |
| 179 ms | 200 | /advanced/lazy-loading/ |
| 179 ms | 200 | /blog/ |
| 179 ms | 200 | /core/examples/ |
| 179 ms | 200 | /core/installation/ |
| 179 ms | 200 | /advanced/file-downloading/ |
| 179 ms | 200 | /core/cache-modes/ |
| 179 ms | 200 | /CONTRIBUTING/ |
| 179 ms | 200 | /apps/ |
| 179 ms | 200 | /api/arun_many/ |
| 179 ms | 200 | /api/arun/ |
| 179 ms | 200 | /advanced/ssl-certificate/ |
| 179 ms | 200 | /stats/ |
| 179 ms | 200 | /blog/articles/adaptive-crawling-revolution/ |
| No rows found, please edit your search term. | ||
TOP slowest URLs
Found 20 row(s).
| Time 🔽 | Status | Slow URL |
|---|---|---|
| 360 ms | 200 | /core/self-hosting/ |
| 249 ms | 200 | /core/simple-crawling/ |
| 236 ms | 200 | /core/deep-crawling/ |
| 223 ms | 200 | /core/page-interaction/ |
| 202 ms | 200 | /core/crawler-result/ |
| 181 ms | 200 | /core/url-seeding/ |
| 180 ms | 200 | /extraction/no-llm-strategies/ |
| 180 ms | 200 | / |
| 180 ms | 200 | /core/link-media/ |
| 180 ms | 200 | /advanced/multi-url-crawling/ |
| 180 ms | 200 | /api/parameters/ |
| 180 ms | 200 | /core/content-selection/ |
| 180 ms | 200 | /core/markdown-generation/ |
| 180 ms | 200 | /core/browser-crawler-config/ |
| 180 ms | 200 | /core/quickstart/ |
| 179 ms | 200 | /advanced/advanced-features/ |
| 179 ms | 200 | /advanced/pdf-parsing/ |
| 179 ms | 200 | /advanced/identity-based-crawling/ |
| 179 ms | 200 | /api/crawl-result/ |
| 179 ms | 200 | /extraction/clustring-strategies/ |
| No rows found, please edit your search term. | ||
Content types
| Content type | URLs 🔽 | Total size | Total time | Avg time | Status 20x | Status 30x | Status 40x |
|---|---|---|---|---|---|---|---|
| HTML | 69 | 5 MB | 12 s | 184 ms | 63 | 0 | 6 |
| Redirect | 2 | 338 B | 357 ms | 178 ms | 0 | 2 | 0 |
Content types (MIME types)
| Content type | URLs 🔽 | Total size | Total time | Avg time | Status 20x | Status 30x | Status 40x |
|---|---|---|---|---|---|---|---|
| text / html | 71 | 5 MB | 13 s | 184 ms | 63 | 2 | 6 |
Source domains
| Domain | Totals | HTML | Redirect |
|---|---|---|---|
| docs.crawl4ai.com | 71 / 5MB / 13s | 69 / 5MB / 12s | 2 / 338B / 357ms |
HTTP headers
| Header 🔼 | Occurs | Unique | Values preview | Min value | Max value |
|---|---|---|---|---|---|
| Content-Length | 2 | - | [ignored generic values] | 178 B | 178 B |
| Content-Type | 71 | 1 | text / html | ||
| Date | 71 | - | [ignored generic values] | 2026-03-24 | 2026-03-24 |
| Etag | 69 | - | [ignored generic values] | ||
| Last-Modified | 63 | - | [ignored generic values] | 2026-02-24 | 2026-02-24 |
| Location | 2 | 2 | /blog/articles/llm-context-revolution/ (1) / /api/parameters/ (1) | ||
| Server | 71 | 1 | nginx/1.24.0 (Ubuntu) |
HTTP header values
| Header | Occurs | Value |
|---|---|---|
| Content-Type | 71 | text / html |
| Location | 1 | /blog/articles/llm-context-revolution/ |
| Location | 1 | /api/parameters/ |
| Server | 71 | nginx/1.24.0 (Ubuntu) |
HTTP Caching by content type (only from crawlable domains)
| Content type | Cache type | URLs 🔽 | AVG lifetime | MIN lifetime | MAX lifetime |
|---|---|---|---|---|---|
| HTML | ETag + Last-Modified | 63 | - | - | - |
| HTML | ETag | 6 | - | - | - |
| Redirect | No cache headers | 2 | - | - | - |
HTTP Caching by domain
| Domain | Cache type | URLs 🔽 | AVG lifetime | MIN lifetime | MAX lifetime |
|---|---|---|---|---|---|
| docs.crawl4ai.com | ETag + Last-Modified | 63 | - | - | - |
| docs.crawl4ai.com | ETag | 6 | - | - | - |
| docs.crawl4ai.com | No cache headers | 2 | - | - | - |
HTTP Caching by domain and content type
| Domain | Content type | Cache type | URLs 🔽 | AVG lifetime | MIN lifetime | MAX lifetime |
|---|---|---|---|---|---|---|
| docs.crawl4ai.com | HTML | ETag + Last-Modified | 63 | - | - | - |
| docs.crawl4ai.com | HTML | ETag | 6 | - | - | - |
| docs.crawl4ai.com | Redirect | No cache headers | 2 | - | - | - |
DNS info
| DNS resolving tree |
|---|
| docs.crawl4ai.com |
| IPv4: 35.163.245.47 |
| DNS server: 127.0.0.53 |
SSL/TLS info
| Info | Text |
|---|---|
| Issuer | C = US, O = Let's Encrypt, CN = E8 |
| Subject | CN = crawl4ai.com |
| Valid from | Mar 10 12:36:46 2026 GMT (VALID already 14.1 day(s)) |
| Valid to | Jun 8 12:36:45 2026 GMT (VALID still for 75.9 day(s)) |
| Supported protocols | TLSv1.2, TLSv1.3 |
| RAW certificate output | Certificate: Data: Version: 3 (0x2) Serial Number: 06:f2:74:ad:9d:05:b0:68:46:4f:8f:8e:89:00:12:c0:d5:84 Signature Algorithm: ecdsa-with-SHA384 Issuer: C = US, O = Let's Encrypt, CN = E8 Validity Not Before: Mar 10 12:36:46 2026 GMT Not After : Jun 8 12:36:45 2026 GMT Subject: CN = crawl4ai.com Subject Public Key Info: Public Key Algorithm: id-ecPublicKey Public-Key: (256 bit) pub: 04:2a:97:9d:3a:5c:1f:b9:a3:ae:ff:53:f5:37:dc: d7:8a:32:51:8a:0f:b1:19:47:b2:54:47:fd:0c:77: 9d:a0:f1:73:e5:73:f2:67:16:55:ce:7f:bb:6d:64: 8e:61:f3:7a:2c:c4:85:25:08:56:a5:82:ed:c0:c3: 96:9f:28:76:0b ASN1 OID: prime256v1 NIST CURVE: P-256 X509v3 extensions: X509v3 Key Usage: critical Digital Signature X509v3 Extended Key Usage: TLS Web Server Authentication X509v3 Basic Constraints: critical CA:FALSE X509v3 Subject Key Identifier: C2:72:71:3C:DD:F0:5B:A2:8B:21:56:F0:CA:8E:1B:74:B4:78:FB:7C X509v3 Authority Key Identifier: 8F:0D:13:A2:F6:2E:7E:D1:50:6C:33:18:38:5D:59:8E:23:72:91:CA Authority Information Access: CA Issuers - URI:http://e8.i.lencr.org/ X509v3 Subject Alternative Name: DNS:*.crawl4ai.com, DNS:crawl4ai.com X509v3 Certificate Policies: Policy: 2.23.140.1.2.1 X509v3 CRL Distribution Points: Full Name: URI:http://e8.c.lencr.org/5.crl CT Precertificate SCTs: Signed Certificate Timestamp: Version : v1 (0x0) Log ID : 64:11:C4:6C:A4:12:EC:A7:89:1C:A2:02:2E:00:BC:AB: 4F:28:07:D4:1E:35:27:AB:EA:FE:D5:03:C9:7D:CD:F0 Timestamp : Mar 10 13:35:17.025 2026 GMT Extensions: none Signature : ecdsa-with-SHA256 30:46:02:21:00:FC:AC:1A:0A:17:DB:44:E0:BD:24:2E: B1:B3:3E:71:DC:B8:D2:08:54:2A:1A:55:23:9A:44:3E: 10:4F:2E:E4:30:02:21:00:97:CC:45:A7:B7:91:00:B1: 61:A8:5F:EE:D6:B8:E9:F2:1D:2B:A2:2A:EB:03:B7:9C: 62:6E:BF:EA:35:57:BE:77 Signed Certificate Timestamp: Version : v1 (0x0) Log ID : E3:23:8D:F2:8D:A2:88:E0:AA:E0:AC:F0:FA:90:C9:85: F0:B6:BF:F5:D2:A5:27:B0:01:FC:1C:44:58:C4:B6:E8 Timestamp : Mar 10 13:35:17.523 2026 GMT Extensions: 00:00:05:00:35:22:1A:1F Signature : ecdsa-with-SHA256 30:44:02:20:6C:64:1B:8E:B0:AE:C1:92:21:D3:22:72: B3:90:37:D5:4F:72:FE:3B:B4:46:28:C6:D3:8F:AE:11: 71:67:CD:B9:02:20:74:99:7C:D2:21:58:57:29:5B:5B: D4:9E:CB:82:3C:49:6E:BB:61:13:2A:56:70:9C:16:75: 3B:33:D0:5D:DB:47 Signature Algorithm: ecdsa-with-SHA384 Signature Value: 30:66:02:31:00:9b:60:6e:88:9b:f4:21:38:8b:54:a5:a3:52: 11:50:14:53:c6:de:4d:fd:66:e0:34:94:1c:c9:b8:bf:1d:a6: 4c:9f:6c:c5:7e:8d:c6:c9:f2:30:3c:b0:16:d4:7a:a7:63:02: 31:00:f0:99:3e:7d:45:44:d0:92:e5:a2:3c:20:b3:c7:24:f1: f1:18:42:1f:29:27:a1:28:54:9b:44:7a:03:83:20:e5:7e:00: e5:da:14:ef:d9:2a:f7:86:1c:78:b9:92:b6:83 |
| RAW protocols output | === ssl2 === s_client: Unknown option: -ssl2 s_client: Use -help for summary. === ssl3 === s_client: Unknown option: -ssl3 s_client: Use -help for summary. === tls1 === 40770DD795760000:error:0A0000BF:SSL routines:tls_setup_handshake:no protocols available:../ssl/statem/statem_lib.c:104: CONNECTED(00000003) --- no peer certificate available --- No client certificate CA names sent --- SSL handshake has read 0 bytes and written 7 bytes Verification: OK --- New, (NONE), Cipher is (NONE) Secure Renegotiation IS NOT supported Compression: NONE Expansion: NONE No ALPN negotiated Early data was not sent Verify return code: 0 (ok) --- === tls1_1 === 408781F13D7F0000:error:0A0000BF:SSL routines:tls_setup_handshake:no protocols available:../ssl/statem/statem_lib.c:104: CONNECTED(00000003) --- no peer certificate available --- No client certificate CA names sent --- SSL handshake has read 0 bytes and written 7 bytes Verification: OK --- New, (NONE), Cipher is (NONE) Secure Renegotiation IS NOT supported Compression: NONE Expansion: NONE No ALPN negotiated Early data was not sent Verify return code: 0 (ok) --- === tls1_2 === depth=2 C = US, O = Internet Security Research Group, CN = ISRG Root X1 verify return:1 depth=1 C = US, O = Let's Encrypt, CN = E8 verify return:1 depth=0 CN = crawl4ai.com verify return:1 CONNECTED(00000003) --- Certificate chain 0 s:CN = crawl4ai.com i:C = US, O = Let's Encrypt, CN = E8 a:PKEY: id-ecPublicKey, 256 (bit); sigalg: ecdsa-with-SHA384 v:NotBefore: Mar 10 12:36:46 2026 GMT; NotAfter: Jun 8 12:36:45 2026 GMT 1 s:C = US, O = Let's Encrypt, CN = E8 i:C = US, O = Internet Security Research Group, CN = ISRG Root X1 a:PKEY: id-ecPublicKey, 384 (bit); sigalg: RSA-SHA256 v:NotBefore: Mar 13 00:00:00 2024 GMT; NotAfter: Mar 12 23:59:59 2027 GMT --- Server certificate -----BEGIN CERTIFICATE----- MIIDkzCCAxigAwIBAgISBvJ0rZ0FsGhGT4+OiQASwNWEMAoGCCqGSM49BAMDMDIx CzAJBgNVBAYTAlVTMRYwFAYDVQQKEw1MZXQncyBFbmNyeXB0MQswCQYDVQQDEwJF ODAeFw0yNjAzMTAxMjM2NDZaFw0yNjA2MDgxMjM2NDVaMBcxFTATBgNVBAMTDGNy YXdsNGFpLmNvbTBZMBMGByqGSM49AgEGCCqGSM49AwEHA0IABCqXnTpcH7mjrv9T 9Tfc14oyUYoPsRlHslRH/Qx3naDxc+Vz8mcWVc5/u21kjmHzeizEhSUIVqWC7cDD lp8odgujggInMIICIzAOBgNVHQ8BAf8EBAMCB4AwEwYDVR0lBAwwCgYIKwYBBQUH AwEwDAYDVR0TAQH/BAIwADAdBgNVHQ4EFgQUwnJxPN3wW6KLIVbwyo4bdLR4+3ww HwYDVR0jBBgwFoAUjw0TovYuftFQbDMYOF1ZjiNykcowMgYIKwYBBQUHAQEEJjAk MCIGCCsGAQUFBzAChhZodHRwOi8vZTguaS5sZW5jci5vcmcvMCcGA1UdEQQgMB6C DiouY3Jhd2w0YWkuY29tggxjcmF3bDRhaS5jb20wEwYDVR0gBAwwCjAIBgZngQwB AgEwLAYDVR0fBCUwIzAhoB+gHYYbaHR0cDovL2U4LmMubGVuY3Iub3JnLzUuY3Js MIIBDAYKKwYBBAHWeQIEAgSB/QSB+gD4AHcAZBHEbKQS7KeJHKICLgC8q08oB9Qe NSer6v7VA8l9zfAAAAGc1/WCIQAABAMASDBGAiEA/KwaChfbROC9JC6xsz5x3LjS CFQqGlUjmkQ+EE8u5DACIQCXzEWnt5EAsWGoX+7WuOnyHSuiKusDt5xibr/qNVe+ dwB9AOMjjfKNoojgquCs8PqQyYXwtr/10qUnsAH8HERYxLboAAABnNf1hBMACAAA BQA1IhofBAMARjBEAiBsZBuOsK7BkiHTInKzkDfVT3L+O7RGKMbTj64RcWfNuQIg dJl80iFYVylbW9Sey4I8SW67YRMqVnCcFnU7M9Bd20cwCgYIKoZIzj0EAwMDaQAw ZgIxAJtgboib9CE4i1Slo1IRUBRTxt5N/WbgNJQcybi/HaZMn2zFfo3GyfIwPLAW 1HqnYwIxAPCZPn1FRNCS5aI8ILPHJPHxGEIfKSehKFSbRHoDgyDlfgDl2hTv2Sr3 hhx4uZK2gw== -----END CERTIFICATE----- subject=CN = crawl4ai.com issuer=C = US, O = Let's Encrypt, CN = E8 --- No client certificate CA names sent Peer signing digest: SHA256 Peer signature type: ECDSA Server Temp Key: X25519, 253 bits --- SSL handshake has read 2333 bytes and written 307 bytes Verification: OK --- New, TLSv1.2, Cipher is ECDHE-ECDSA-AES256-GCM-SHA384 Server public key is 256 bit Secure Renegotiation IS supported Compression: NONE Expansion: NONE No ALPN negotiated SSL-Session: Protocol : TLSv1.2 Cipher : ECDHE-ECDSA-AES256-GCM-SHA384 Session-ID: DFE1C4DB229AFA8521FB5C4855F36780058A4A3B70715491B6994C0C1C85CDEB Session-ID-ctx: Master-Key: 4EA712BE001A51648BCE5A833718F315518029390411DA10C44E7637F51F0E5C293AA6E74713EDF56624410DCF1B7C46 PSK identity: None PSK identity hint: None SRP username: None Start Time: 1774362816 Timeout : 7200 (sec) Verify return code: 0 (ok) Extended master secret: yes --- DONE === tls1_3 === depth=2 C = US, O = Internet Security Research Group, CN = ISRG Root X1 verify return:1 depth=1 C = US, O = Let's Encrypt, CN = E8 verify return:1 depth=0 CN = crawl4ai.com verify return:1 CONNECTED(00000003) --- Certificate chain 0 s:CN = crawl4ai.com i:C = US, O = Let's Encrypt, CN = E8 a:PKEY: id-ecPublicKey, 256 (bit); sigalg: ecdsa-with-SHA384 v:NotBefore: Mar 10 12:36:46 2026 GMT; NotAfter: Jun 8 12:36:45 2026 GMT 1 s:C = US, O = Let's Encrypt, CN = E8 i:C = US, O = Internet Security Research Group, CN = ISRG Root X1 a:PKEY: id-ecPublicKey, 384 (bit); sigalg: RSA-SHA256 v:NotBefore: Mar 13 00:00:00 2024 GMT; NotAfter: Mar 12 23:59:59 2027 GMT --- Server certificate -----BEGIN CERTIFICATE----- MIIDkzCCAxigAwIBAgISBvJ0rZ0FsGhGT4+OiQASwNWEMAoGCCqGSM49BAMDMDIx CzAJBgNVBAYTAlVTMRYwFAYDVQQKEw1MZXQncyBFbmNyeXB0MQswCQYDVQQDEwJF ODAeFw0yNjAzMTAxMjM2NDZaFw0yNjA2MDgxMjM2NDVaMBcxFTATBgNVBAMTDGNy YXdsNGFpLmNvbTBZMBMGByqGSM49AgEGCCqGSM49AwEHA0IABCqXnTpcH7mjrv9T 9Tfc14oyUYoPsRlHslRH/Qx3naDxc+Vz8mcWVc5/u21kjmHzeizEhSUIVqWC7cDD lp8odgujggInMIICIzAOBgNVHQ8BAf8EBAMCB4AwEwYDVR0lBAwwCgYIKwYBBQUH AwEwDAYDVR0TAQH/BAIwADAdBgNVHQ4EFgQUwnJxPN3wW6KLIVbwyo4bdLR4+3ww HwYDVR0jBBgwFoAUjw0TovYuftFQbDMYOF1ZjiNykcowMgYIKwYBBQUHAQEEJjAk MCIGCCsGAQUFBzAChhZodHRwOi8vZTguaS5sZW5jci5vcmcvMCcGA1UdEQQgMB6C DiouY3Jhd2w0YWkuY29tggxjcmF3bDRhaS5jb20wEwYDVR0gBAwwCjAIBgZngQwB AgEwLAYDVR0fBCUwIzAhoB+gHYYbaHR0cDovL2U4LmMubGVuY3Iub3JnLzUuY3Js MIIBDAYKKwYBBAHWeQIEAgSB/QSB+gD4AHcAZBHEbKQS7KeJHKICLgC8q08oB9Qe NSer6v7VA8l9zfAAAAGc1/WCIQAABAMASDBGAiEA/KwaChfbROC9JC6xsz5x3LjS CFQqGlUjmkQ+EE8u5DACIQCXzEWnt5EAsWGoX+7WuOnyHSuiKusDt5xibr/qNVe+ dwB9AOMjjfKNoojgquCs8PqQyYXwtr/10qUnsAH8HERYxLboAAABnNf1hBMACAAA BQA1IhofBAMARjBEAiBsZBuOsK7BkiHTInKzkDfVT3L+O7RGKMbTj64RcWfNuQIg dJl80iFYVylbW9Sey4I8SW67YRMqVnCcFnU7M9Bd20cwCgYIKoZIzj0EAwMDaQAw ZgIxAJtgboib9CE4i1Slo1IRUBRTxt5N/WbgNJQcybi/HaZMn2zFfo3GyfIwPLAW 1HqnYwIxAPCZPn1FRNCS5aI8ILPHJPHxGEIfKSehKFSbRHoDgyDlfgDl2hTv2Sr3 hhx4uZK2gw== -----END CERTIFICATE----- subject=CN = crawl4ai.com issuer=C = US, O = Let's Encrypt, CN = E8 --- No client certificate CA names sent Peer signing digest: SHA256 Peer signature type: ECDSA Server Temp Key: X25519, 253 bits --- SSL handshake has read 2413 bytes and written 331 bytes Verification: OK --- New, TLSv1.3, Cipher is TLS_AES_256_GCM_SHA384 Server public key is 256 bit Secure Renegotiation IS NOT supported Compression: NONE Expansion: NONE No ALPN negotiated Early data was not sent Verify return code: 0 (ok) --- DONE |
Crawler stats
| Basic stats | |
|---|---|
| Total execution time | 12 s |
| Total URLs | 71 |
| Total size | 5 MB |
| Requests - total time | 13 s |
| Requests - avg time | 185 ms |
| Requests - min time | 179 ms |
| Requests - max time | 360 ms |
| Requests by status | 200: 63 301: 2 404: 6 |
Analysis stats
Found 21 row(s).
| Class::method | Exec time 🔽 | Exec count |
|---|---|---|
| SslTlsAnalyzer::getTLSandSSLCertificateInfo | 3.1 s | 1 |
| AccessibilityAnalyzer::checkMissingAriaLabels | 289 ms | 63 |
| AccessibilityAnalyzer::checkMissingLabels | 265 ms | 63 |
| BestPracticeAnalyzer::checkHeadingStructure | 233 ms | 69 |
| AccessibilityAnalyzer::checkMissingRoles | 225 ms | 63 |
| AccessibilityAnalyzer::checkMissingLang | 199 ms | 63 |
| BestPracticeAnalyzer::checkMaxDOMDepth | 194 ms | 69 |
| BestPracticeAnalyzer::checkNonClickablePhoneNumbers | 73 ms | 69 |
| BestPracticeAnalyzer::checkMissingQuotesOnAttributes | 15 ms | 69 |
| SeoAndOpenGraphAnalyzer::analyzeHeadings | 8 ms | 1 |
| SecurityAnalyzer::checkHtmlSecurity | 8 ms | 69 |
| AccessibilityAnalyzer::checkImageAltAttributes | 5 ms | 63 |
| BestPracticeAnalyzer::checkInlineSvg | 5 ms | 69 |
| SecurityAnalyzer::checkHeaders | 1 ms | 69 |
| SeoAndOpenGraphAnalyzer::analyzeSeo | 0 ms | 1 |
| BestPracticeAnalyzer::checkTitleUniqueness | 0 ms | 1 |
| BestPracticeAnalyzer::checkMetaDescriptionUniqueness | 0 ms | 1 |
| SeoAndOpenGraphAnalyzer::analyzeOpenGraph | 0 ms | 1 |
| BestPracticeAnalyzer::checkBrotliSupport | 0 ms | 1 |
| BestPracticeAnalyzer::checkWebpSupport | 0 ms | 1 |
| BestPracticeAnalyzer::checkAvifSupport | 0 ms | 1 |
| No rows found, please edit your search term. | ||
Content processor stats
Found 12 row(s).
| Class::method | Exec time 🔽 | Exec count |
|---|---|---|
| HtmlProcessor::findUrls | 89 ms | 71 |
| NextJsProcessor::applyContentChangesBeforeUrlParsing | 36 ms | 69 |
| JavaScriptProcessor::findUrls | 34 ms | 69 |
| CssProcessor::findUrls | 3 ms | 69 |
| AstroProcessor::findUrls | 0 ms | 69 |
| AstroProcessor::applyContentChangesBeforeUrlParsing | 0 ms | 69 |
| NextJsProcessor::findUrls | 0 ms | 69 |
| JavaScriptProcessor::applyContentChangesBeforeUrlParsing | 0 ms | 69 |
| HtmlProcessor::applyContentChangesBeforeUrlParsing | 0 ms | 71 |
| SvelteProcessor::findUrls | 0 ms | 69 |
| SvelteProcessor::applyContentChangesBeforeUrlParsing | 0 ms | 69 |
| CssProcessor::applyContentChangesBeforeUrlParsing | 0 ms | 69 |
| No rows found, please edit your search term. | ||
Crawler info
| Version | 2.1.0.20260317 |
|---|---|
| Executed At | 2026-03-24 14:33:25 |
| Command | siteone-crawler --url=https://docs.crawl4ai.com --markdown-export-dir=/tmp/siteone-crawl4ai --markdown-exclude-selector=header,footer,nav,.sidebar,.menu,.breadcrumb,script,style --timeout=30 --workers=5 --disable-javascript --disable-styles --disable-fonts --disable-images --disable-files --no-color --hide-progress-bar --output=text |
| Hostname | ubuntu-8gb-hel1-1 |
| User-Agent | Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/26.0.0.0 Safari/537.36 siteone-crawler/2.1.0.20260317 |