MarketCheck collects comprehensive heavy equipment listings and inventory data from dealer websites. This includes key information such as make, model, year, price, mileage, VIN, vehicle features, and dealer contact details. We capture data for both new and used heavy equipment, ensuring broad and accurate coverage of the available market inventory.
Market | Sources |
---|---|
United States | Dealer websites |
MarketCheck began collecting heavy equipment inventory data in 2020. Since then, we’ve built one of the most comprehensive and continuously updated heavy equipment datasets in the industry.
Market | Websites Crawled Daily | Daily Listings Volume | Historical Dataset Size |
---|---|---|---|
United States | 3100+ | ~250K | ~10 million listings* |
Data Points: Each listing contains approximately 45 data points, ensuring detailed vehicle information across all markets.
MarketCheck uses Autobot, our proprietary crawling platform developed and refined over 5 years of continuous operation.
MarketCheck indexes and classifies websites from the internet to add to its crawling platform. The websites that are discovered containing heavy equipment inventory data are then added to the crawling platform for regular crawl.
Autobot employs a systematic approach with 24/7 crawling operations monitored by a dedicated operations team for uptime.
Website Type | Crawling Frequency |
---|---|
Dealer websites | Every 48 hours |
Crawling Focus: MarketCheck only crawls inventory pages from sites - other pages are skipped. We do not crawl the full website.
Phase 1: Search Result Pages (SRPs)
Phase 2: Vehicle Detail Pages (VDPs) VDP crawling follows specific logic based on listing status:
Listing Status | VDP Crawl Decision |
---|---|
New listing (first time seen) | VDP crawled same day |
Existing listing (no changes in SRP) | VDP skipped, unless 14+ days since last VDP crawl |
Existing listing (changes detected) | VDP crawled same day (price, mileage, or other attribute changes) |
MarketCheck uses rules-based extraction employing XPath expressions, regular expressions (regex), and JSON extraction over automated natural language extraction to achieve highest accuracy.
Processing Pipeline:
The MarketCheck crawling platform access to all webpages during both search results page crawls and vehicle detail page crawls.
Issue Resolution:
After crawling and extraction phases are completed, the parsing process ensures data quality and consistency.
Response Timeline:
When quality issues are identified, alerts are raised and the operations team reviews and resolves them on priority within 24-48 hours.
Access Method | Description |
---|---|
Daily Data Feed Dumps | Complete batch data delivery |
API Access | Real-time programmatic data access |
For detailed information about each access method, visit their respective documentation pages.
This data gathering operation represents 5 years of consistent, high-accuracy recreational vehicle data collection, providing customers with detailed vehicle inventory intelligence.