Generates a domains.json file for each domain with metadata for the site and it's status - this is needed because quite a few of them are 404 or timeouts.
Each domain is stored as flat-files:
- https://indie-stats.com/domain to claim or mark as excluded your domain.
- https://indie-stats.com/domain?domain=bear.im to view domain details
- Domain owners can login and claim and/or exclude their domains from being processed
- Crawl IndieWeb domains and store
- mf2 data
- html content
- request and response headers
- Maintain metadata for domains showing their current status
- Domain list is seeded from IRC-people
Storing request and response headers
For each domain crawled the domain, timestamp and data will be passed to a master "cruncher" that will then loop thru a list of stat generating apps. The resulting json blob from this generating app will be added along with namespace and timestamp to the stat history for the domain. Stat items to calculate:
- have a header for auth
- use indieauth as their auth item
- have a header for webmention
- have a header for micropub
- have a h-card
- have a h-entry
Add an endpoint to allow for a call to be made for a domain and a date range and the response will be the json blob of stats.