Commit graph

7 commits

Author SHA1 Message Date
Kadir
f98cd02696 adding better exception messages and interrupt free crawling 2024-05-06 18:54:55 +01:00
Kadir
874fed3e8f format and committing notebook 2024-04-05 11:38:55 +01:00
Kadir
5305451fe8 changing detail downloads to prefiltering first. Making the progress bar more accurate and frontloading 2024-04-01 20:28:37 +02:00
Kadir
acc67192c9 add counter to 2_dump_details to understand how many new are crawled 2024-03-30 19:24:03 +01:00
Kadir
d777558b34 ruff format 2024-03-25 20:48:48 +00:00
Kadir
36258d877f crawling for 3 and refactoring to allow incremental crawls 2024-03-11 14:43:53 +00:00
Kadir
508aa02812 Real crawling scripts and floorplan detection
1. get all listings
2. get all detail jsons
3. get all images
4. get all floorplans
5. detecting floorplans

Also updating dependencies for huggingface etc.
2024-03-10 18:49:39 +00:00