Technical documentation of the Meta Tag Generator: PHP-based URL crawler, multicrawl with Server-Sent Events, multi-provider AI architecture with rule-based fallback, and outputs for Title, Description, Keywords, Open Graph, Twitter Card, JSON-LD and Robots.
v1.07 AI providersSingle URL + MulticrawlPHP · Vanilla JSOpen Source
The Meta Tag Generator crawls any URL server-side and extracts the full page content – H1, H2 headings, paragraphs, existing meta tags, JSON-LD blocks and body text. This content is passed to an AI provider (or a rule-based fallback) which generates optimised meta tags based on what the page is actually about.
Version 1.0 introduces Multicrawl mode: the tool automatically discovers all URLs of a domain via sitemap.xml (with robots.txt and link-extraction fallback), then processes every page sequentially with live progress updates via Server-Sent Events. This makes it possible to generate meta tags for an entire website in one run.
The AI provider is fully swappable via a single config line – from no AI (free, rule-based) to Gemini, Claude, GPT-4o, Perplexity, Grok or any OpenAI-compatible endpoint.
Tool scope
Modes: Single URL or Multicrawl (full domain)
Sitemap discovery: sitemap.xml → robots.txt → link extraction fallback
Max URLs per run: 500 (configurable)
Live updates: Server-Sent Events (SSE) during multicrawl
Retry logic: Auto-retry on 503/429 (3 attempts, exponential delay)
AI providers: 7 (none/rule-based, Anthropic, OpenAI, Google, Perplexity, Grok, OwnAI)