wrongmove

Author	SHA1	Message	Date
Viktor Barzin	b9f576ae2b	Stream-process listings as IDs arrive via asyncio.Queue Replace the sequential fetch-all-then-process pipeline with a streaming architecture where listing processing starts as soon as IDs become available from each subquery. A producer task fetches pages and enqueues new IDs (filtered inline against DB), while 20 consumer workers process listings concurrently from the queue. - Add ListingRepository.get_listing_ids() for fast ID-only projection - Refactor listing_tasks.py: remove get_ids_to_process/dump_listings_and_monitor, replace with unified producer/worker/monitor pipeline - Apply same pattern to CLI path in listing_fetcher.py - Remove 'filtering' phase from frontend, show combined fetch+process metrics - Add fetching_done flag to TaskResult for phase transition tracking	2026-02-06 23:43:54 +00:00
Viktor Barzin	b4837e1603	Add crawl job progress drawer with phase tracking and live logs - Add phase-aware progress reporting across all crawl phases (splitting, fetching, filtering, processing) with per-step counters - Add TaskProgressDrawer component with phase timeline stepper, detail counters, progress bar with ETA, and live worker log viewer - Add on_step_complete callback to ListingProcessor for granular tracking of details/images/OCR steps - Extend QuerySplitter on_progress callback with structured counter data - Capture celery worker logs via ring buffer handler and inject into task state updates for frontend display - Guard taskResult updates with phase presence check to prevent drawer from blanking during state transitions	2026-02-06 22:37:53 +00:00
Viktor Barzin	d205d15c74	Add services layer, tests, streaming UI, and cleanup legacy code	2026-02-06 20:55:10 +00:00
Viktor Barzin	5514fa6381	Remove 1000-result limit, add Redis caching and virtual scrolling - Remove hard-coded limit=1000 default from listing_geojson and streaming endpoints, allowing all matching results to be returned - Add Redis caching service (db=2, 30min TTL) that caches query results as Redis Lists for fast re-queries with reduced DB load - Integrate cache into streaming endpoint: serve from cache on hit, populate cache on miss during DB streaming - Invalidate cache after scrape completes (both success and no-new-listings) - Replace ScrollArea with react-virtuoso in ListView for virtual scrolling, keeping only ~20-30 DOM nodes regardless of list size - Handle metadata streaming message to show "0 / N" progress from start - Throttle frontend state updates with requestAnimationFrame to prevent UI jank from rapid re-renders during cached response streaming	2026-02-06 20:47:36 +00:00
Viktor Barzin	f880664a98	Add throttling detection and circuit breaker for Rightmove scraper	2026-02-06 20:47:36 +00:00
Viktor Barzin	e8293c6042	Add intelligent query splitting to maximize Rightmove data extraction	2026-02-06 20:47:36 +00:00
Viktor Barzin	835a2a9d53	Fix stuck Celery tasks and add purge all tasks functionality	2026-02-01 20:40:07 +00:00

7 commits