# www.weatherbug.com - AI Crawler Index ## Quick Stats - Total Pages: 9 - Last Updated: 2026-01-31T22:46:27.393Z - Format: HTML + Markdown + JSON-LD - Optimized for: GPT, Claude, Gemini, Llama, and other LLMs - Original Source: https://www.weatherbug.com ## Content Hierarchy ### Folder: air-quality/ ### Folder: air-quality/adra-west-bengal-in/ - Today's Air Quality in Adra, West Bengal, IN | Report & Map | WeatherBug: air-quality/adra-west-bengal-in/index.html ### Folder: air-quality/irtyshsk-pavlodar-kz/ - Today's Air Quality in Irtyshsk, Pavlodar, KZ | Report & Map | WeatherBug: air-quality/irtyshsk-pavlodar-kz/index.html ### Folder: air-quality/south-worcester-ma-01610/ - Today's Air Quality in South Worcester, MA | Report & Map | WeatherBug: air-quality/south-worcester-ma-01610/index.html ### Folder: alerts/ ### Folder: alerts/dublin-oh-43016/ - Weather Alerts For Dublin, OH - Severe Weather Updates & Map | WeatherBug: alerts/dublin-oh-43016/index.html ### Folder: alerts/tri-cities-tn-37617/ - Weather Alerts For Tri-Cities, TN - Severe Weather Updates & Map | WeatherBug: alerts/tri-cities-tn-37617/index.html ### Folder: news/ ### Folder: news/the-november-witches-of-the-great-lakes/ - The November Witches Of The Great Lakes | WeatherBug | WeatherBug: news/the-november-witches-of-the-great-lakes/index.html ### Folder: weather-camera/ ### Folder: weather-camera/south-florida-fl-33021/ - South Florida, FL Live Weather Cameras & Webcams | WeatherBug: weather-camera/south-florida-fl-33021/index.html ### Folder: weather-forecast/ ### Folder: weather-forecast/hourly/ - Local and National Hourly Weather Forecasts | WeatherBug: weather-forecast/hourly/index.html - Local & National Weather Forecasts - Weather Radar & News: site-root.html ## Access Patterns Each page is available in multiple formats: - HTML: [page].html (includes semantic navigation and structured data) - Markdown: [page].md (content only, ideal for text processing) ## Machine-Readable Formats - Sitemap: /sitemap.xml (full page listing with lastmod dates) - Robots: /robots.txt (crawler directives) - This file: /llms.txt (human and AI readable index) - Index: /index.html (interactive hierarchical navigation) ## Content Structure All processed pages include: - Semantic HTML5 markup (article, section, nav, header) - JSON-LD structured data (WebPage schema) - Open Graph metadata - Clean, script-free content - Preserved original metadata (title, author, date, description) - Hierarchical navigation context ## Recommended for - Training data collection - Documentation analysis - Content understanding - Semantic search indexing - Knowledge graph construction - Automated summarization - Cross-reference analysis