Commit Graph

  • 652261b774 Added support for known hub URLs in scout_logic.py to handle Playwright timeouts and errors more effectively. Updated fetching logic to prioritize known URLs when encountering issues, enhancing reliability in link extraction. main Lars 2026-01-31 18:42:42 +0100
  • 8000642eae Updated scout_logic.py to clarify the strategy for identifying the entry or overview page (Hub) for research reports, enhancing the prompt for link analysis to focus on Hub URLs rather than final download pages. Lars 2026-01-31 18:39:08 +0100
  • f7b328b7f2 Refined timeout strategy in scout_logic.py for URL fetching, introducing separate timeouts for 'commit' and 'domcontentloaded' states, and enhanced logging for better error visibility during page loading attempts. Lars 2026-01-31 18:33:44 +0100
  • beb80e9eaf Refactored timeout handling in scout_logic.py to improve URL fetching reliability, added detailed logging for error tracking, and implemented a total timeout for Playwright operations to prevent indefinite hangs. Lars 2026-01-31 18:28:26 +0100
  • b3e9a6455b Enhanced main.py and scout_logic.py with improved timeout handling for URL fetching, added logging for better request tracking, and optimized page loading strategy to prevent hangs on heavy pages. Lars 2026-01-31 18:25:23 +0100
  • 46b59d2c5c Updated docker-compose.yml to clarify port mapping, modified Dockerfile.worker to enable access logging, and added logging functionality in main.py for request tracking. Lars 2026-01-31 18:19:41 +0100
  • 3542e4564b Updated docker-compose.yml to change the port mapping from 8000 to 8010 for the worker service. Lars 2026-01-31 18:15:15 +0100
  • 9dd44af2d4 Updated scout_logic.py to use the new Stealth class for bot detection evasion, replacing the previous stealth_async function call. Lars 2026-01-31 18:13:40 +0100
  • a18801a6aa Refactored import statement for playwright_stealth in scout_logic.py to align with updated package structure. Lars 2026-01-31 18:11:33 +0100
  • 9c5f769455 Added playwright-stealth dependency and refactored link fetching logic in scout_logic.py to enhance bot detection evasion and implement HTTP/2 fallback handling. Lars 2026-01-31 18:08:40 +0100
  • afee46933f Enhanced scout_logic.py with improved browser configuration to bypass bot detection, added URL normalization functions, and implemented robust error handling for fetching links. Lars 2026-01-31 18:04:49 +0100
  • 6e813daf69 Updated project structure and added initial configuration files. Lars 2026-01-31 17:36:48 +0100
  • bd0e602b09 initialer Setup Lars 2026-01-31 11:56:20 +0100
  • 24964cd507 readme hinzugefügt Lars 2026-01-31 12:02:18 +0100