Skip to content

Arquivo Web Crawler

GoodBotVerifiedArchiverBOT

Operated by Arquivo Docs

What is Arquivo Web Crawler?

Generated from Cloudflare, operator docs, and community data

Arquivo Web Crawler is a web crawler operated by Arquivo. Web crawler archives the Portuguese web

Helpful — Verified, safe crawler. Respects robots.txt and provides operator documentation.

Details

Bot Name
Arquivo Web Crawler
Slug
arquivo
Kind
BOT
Verification
Verified by Cloudflare
Category
Archiver
Operator
Arquivo

User-Agent Patterns

Arquivo-web-crawler

Full User-Agent Strings

Arquivo-web-crawler (compatible; heritrix/3.4.0-20200304 +https://arquivo.pt/faq-crawling)

Main Use Cases

Web crawlingData collection

Why It Crawls Your Site

Arquivo Web Crawler crawls websites to archive and preserve web content. It is operated by Arquivo as part of their web archival infrastructure. If you see this bot in your server logs, it is a verified crawler and generally safe.

How to Block / Whitelist

Block Arquivo Web Crawler

Add a Disallow rule for Arquivo-web-crawler in your robots.txt file. You can also block at the server level using your web server configuration or CDN firewall rules to filter requests matching the user-agent string.

# Block Arquivo Web Crawler from crawling your entire site
User-agent: Arquivo-web-crawler
Disallow: /

# Allow Arquivo Web Crawler full access
User-agent: Arquivo-web-crawler
Allow: /

Whitelist Arquivo Web Crawler

Ensure your robots.txt allows Arquivo-web-crawler. Verify requests are genuine by checking the user-agent string and referring to Arquivo's documentation.

Traffic & Trends

Cloudflare Radar Traffic

No Cloudflare traffic data available

Google Trends Interest

Links & References

Data Sources

This profile was compiled from the following sources:

CloudflareRadar

Last updated: May 26, 2026