Smart Groceries — Cron Session 2026-06-13
DB Health Check (verified via direct sqlite3 query)
| Metric | Value | Notes |
|---|---|---|
| DB size | 1.6MB (1,626,112 bytes) | Stable since June 9 cleanup |
| products | 5,047 | All Woolworths, unchanged from last session |
| price_checks | 8,336 | Unchanged |
| categories | 34 | Unchanged (duplicates cleaned June 9) |
| stores | 2 | Coles (id=1), Woolworths (id=2) |
| Null prices | 194 (3.8%) | Improved from ~5% — data quality improving |
| On special | 978 (19.4%) / 4,069 regular | Consistent |
| Last import | 2026-06-06 03:29:33 | 7 days ago — stale |
Assessment
Data is stale by 7 days. The CronJob (smart-groceries-catalogue-scrape) runs daily at 07:32 AEST but only imports Woolworths data (Coles blocked). Null price rate is improving (3.8% vs ~5% earlier), suggesting the importer logic is maturing.
Blockers (unchanged)
- ❌ Coles Imperva WAF — No autonomous fix possible. Requires residential proxy procurement decision from pvs.
- 🔄 MR !1 cannot merge as-is — Contains 3 regressions vs main (reverts ORM fix, SSH clone → HTTPS PAT, namespace change). Needs rebase + selective merge by pvs.
Actions taken
- ✅ DB health check completed and documented
- ✅ Null price rate verified at 3.8% (improving trend)
- 📝 Index.md updated with latest null price metric