MarketCheck provides comprehensive automotive inventory data through systematic collection from dealer websites, auction sites, and private party sellers across multiple markets. This guide explains our data collection methodology, coverage, and quality assurance processes.
MarketCheck collects vehicle listings and inventory data from dealer websites, auction sites, and for-sale-by-owner (FSBO) platforms.
Market | Sources |
---|---|
United States | Dealer websites, auction sites, private party sellers |
Canada | Dealer websites, private party sellers |
United Kingdom | Dealer websites, private party sellers |
MarketCheck has been gathering car inventory data since 2015, establishing one of the most complete automotive datasets in the industry.
Platform Evolution
2015 ──── US Dealer Websites
│
2017 ──── US Private Party Sellers
│
2018 ──── Canadian Dealer Websites
│
2018 ──── US Auction Sites
│
2022 ──── UK Dealer Websites
│
2023 ──── Canadian Private Party Sellers
│
2024 ──── UK Private Party Sellers
Market | Websites Crawled Daily | Daily Listings Volume | Historical Dataset Size |
---|---|---|---|
United States | 80,000+ | ~15 million | ~5 billion listings* |
Canada | 8,200+ | ~1 million | |
United Kingdom | 10,000+ | ~600,000 | ~65 million |
Data Points: Each listing contains approximately 110 data points, ensuring detailed vehicle information across all markets.
MarketCheck uses Autobot, our proprietary crawling platform developed and refined over 10 years of continuous operation.
MarketCheck indexes and classifies websites from the internet to add to its crawling platform. The websites that are discovered containing car inventory data are then added to the crawling platform for regular crawl.
Autobot employs a systematic approach with 24/7 crawling operations monitored by a dedicated operations team for uptime.
Website Type | Crawling Frequency |
---|---|
Dealer websites (all countries) | Daily |
Auction and private party sites | Every 48 hours |
Crawling Focus: MarketCheck only crawls inventory pages from sites - other pages are skipped. We do not crawl the full website.
Phase 1: Search Result Pages (SRPs)
Phase 2: Vehicle Detail Pages (VDPs) VDP crawling follows specific logic based on listing status:
Listing Status | VDP Crawl Decision |
---|---|
New listing (first time seen) | VDP crawled same day |
Existing listing (no changes in SRP) | VDP skipped, unless 14+ days since last VDP crawl |
Existing listing (changes detected) | VDP crawled same day (price, mileage, or other attribute changes) |
MarketCheck uses rules-based extraction employing XPath expressions, regular expressions (regex), and JSON extraction over automated natural language extraction to achieve highest accuracy.
Processing Pipeline:
MarketCheck employs continuous operational monitoring of its crawling platform to ensure complete inventory coverage. During both search result page crawls and vehicle detail page crawls, the system ensures access to all webpages through necessary means so that no inventory listings are lost.
Monitoring and Alerts:
When connectivity issues are detected—whether complete or partial website access problems—alerts are immediately raised and sent to the operations team who monitor crawls 24/7.
Issue Resolution:
If a website is temporarily down or status is uncertain, MarketCheck continues monitoring and probing for uptime over the foreseeable period.
After crawling and extraction phases are completed, the parsing process ensures data quality and consistency. The operations team conducts multiple daily reviews of crawled pages and extracted data from previous windows, verifying that coverage and quality of critical data points remain consistent and meet standards.
Quality Assurance Process:
Response Timeline:
When quality issues are identified, alerts are raised and the operations team reviews and resolves them on priority within 24-48 hours.
This consistent operation has maintained high-accuracy automotive data collection for 10 years, continuously strengthening our extensive dataset.
Access Method | Description |
---|---|
Daily Data Feed Dumps | Complete batch data delivery |
API Access | Real-time programmatic data access |
For detailed information about each access method, visit their respective documentation pages.
This data gathering operation represents 10 years of consistent, high-accuracy automotive data collection, providing customers with detailed vehicle inventory intelligence.