progression V2 #57

Merged

Lars merged 20 commits from develop into main

2026-06-13 16:34:09 +02:00

Author	SHA1	Message	Date
Lars	7265cd5a01	Add findings_stale field to GraphPlanningRoadmapArtifact and update ProgressionGraphEditor for state management All checks were successful Deploy Development / deploy (push) Successful in 43s Details Test Suite / pytest-backend (push) Successful in 45s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 34s Details Test Suite / playwright-tests (push) Successful in 1m34s Details - Introduced `findings_stale` field in `GraphPlanningRoadmapArtifact` to track the freshness of findings. - Updated `ProgressionGraphEditor` to manage `findingsStale` state across various functions, ensuring accurate representation of evaluation status. - Modified related utility functions and tests to accommodate the new state, enhancing overall functionality and user feedback in the progression graph management process.	2026-06-13 16:29:17 +02:00
Lars	5e5f4ca8d4	Enhance Progression Findings and Graph Editor with Evaluation Staleness Handling Some checks failed Deploy Development / deploy (push) Successful in 40s Details Test Suite / pytest-backend (push) Successful in 47s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 13s Details Test Suite / k6 /health Baseline (push) Failing after 2s Details Test Suite / playwright-tests (push) Successful in 1m19s Details - Added `evaluationStale` state to `ProgressionGraphEditor` and `ProgressionFindingsPanel` to track the freshness of evaluations. - Updated UI to display a warning when evaluations are stale, prompting users to re-evaluate the graph. - Modified loading and evaluation functions to manage the `evaluationStale` state effectively, ensuring accurate user feedback during the evaluation process. - Improved user notifications regarding the need for re-evaluation after changes to the graph.	2026-06-13 16:23:04 +02:00
Lars	f0e581a9f5	Implement Off-Topic Slot Gap Specification and Unified Slot Review Enhancements All checks were successful Deploy Development / deploy (push) Successful in 42s Details Test Suite / pytest-backend (push) Successful in 43s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 13s Details Test Suite / k6 /health Baseline (push) Successful in 33s Details Test Suite / playwright-tests (push) Successful in 1m34s Details - Introduced `_build_off_topic_slot_gap_spec` to generate specifications for off-topic slots, improving the handling of filled but thematically inappropriate slots. - Added `_build_unified_slot_review_entry` to streamline the review process for slots, incorporating various parameters for better evaluation and suggestions. - Enhanced existing logic in slot management to improve the robustness of path evaluations and user feedback. - Added tests for the new off-topic slot gap specification to ensure functionality and correctness.	2026-06-13 12:43:59 +02:00
Lars	cd457e3ea0	Enhance Slot Evaluation and Scoring Mechanisms All checks were successful Deploy Development / deploy (push) Successful in 46s Details Test Suite / pytest-backend (push) Successful in 44s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 33s Details Test Suite / playwright-tests (push) Successful in 1m26s Details - Introduced new functions `_off_topic_semantic_scores_by_slot` and `_score_exercise_stage_fit_for_spec` to improve the evaluation of off-topic steps and exercise stage fit, enhancing the quality assessment process. - Updated `_run_unified_slot_improvement_review` to incorporate off-topic scores and exercise stage fit scoring, refining the decision-making process for slot suggestions. - Enhanced existing logic to streamline the handling of slot scores and improve the overall robustness of slot management in path evaluations.	2026-06-13 12:33:16 +02:00
Lars	e9bf5bd1a5	Enhance Path Evaluation and Slot Management Features All checks were successful Deploy Development / deploy (push) Successful in 44s Details Test Suite / pytest-backend (push) Successful in 44s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 34s Details Test Suite / playwright-tests (push) Successful in 1m13s Details - Introduced `_parse_slot_refs_from_text` to extract and convert slot references from text, improving the handling of user input in path evaluations. - Updated `_problematic_slots_from_path_qa` to utilize the new parsing function, enhancing the identification of problematic slots based on various hints and issues. - Enhanced `ProgressionGraphEditor` and `ProgressionOptimizeCompareModal` to better display identified problem slots and their associated reasons, improving user feedback during evaluations. - Added tests for new parsing functionality and its integration with existing slot management processes, ensuring robustness in slot reference handling.	2026-06-13 12:17:58 +02:00
Lars	3468b2066e	Enhance Path QA and Progression Review Logic All checks were successful Deploy Development / deploy (push) Successful in 44s Details Test Suite / pytest-backend (push) Successful in 44s Details Test Suite / lint-backend (push) Successful in 1s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 41s Details Test Suite / playwright-tests (push) Successful in 1m27s Details - Introduced `_resolve_hint_major_index` to accurately map hints to major step indices, improving the handling of optimization hints in path evaluations. - Added `_problematic_slots_from_path_qa` to identify and categorize problematic slots based on baseline QA, enhancing the quality assessment process. - Updated `_slot_suggestion_accepted` to incorporate new parameters for slot problems and stage specifications, refining the decision-making process for slot suggestions. - Enhanced `ProgressionGraphEditor` to improve user notifications regarding identified issues and suggestions, ensuring clearer communication of path evaluation results. - Modified `buildProgressionComparePayload` and `buildUnifiedSlotReviewComparePayload` to support baseline evaluations, streamlining the comparison process for proposed paths.	2026-06-13 10:39:52 +02:00
Lars	a1e4ad66df	Implement Quick Evaluation and Quality Scoring for Path QA All checks were successful Deploy Development / deploy (push) Successful in 40s Details Test Suite / pytest-backend (push) Successful in 44s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 33s Details Test Suite / playwright-tests (push) Successful in 1m11s Details - Added `_quick_evaluate_steps_qa` function to streamline path quality assessment without recursive API calls, enhancing performance for slot comparisons. - Introduced `compute_deterministic_path_quality_score` to provide a heuristic quality score based on gaps and off-topic steps, improving evaluation accuracy. - Updated `_run_unified_slot_improvement_review` to utilize the new quick evaluation method, optimizing the review process and integrating quality scoring. - Enhanced `build_path_qa_summary` to include quality score calculations, ensuring comprehensive feedback on path evaluations. - Refactored related functions for improved clarity and efficiency in handling path quality assessments.	2026-06-13 10:27:07 +02:00
Lars	85fccdd093	Enhance Progression Path Comparison and Slot Evaluation Features All checks were successful Deploy Development / deploy (push) Successful in 43s Details Test Suite / pytest-backend (push) Successful in 44s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 33s Details Test Suite / playwright-tests (push) Successful in 1m12s Details - Introduced new fields in `ProgressionPathSuggestRequest` for baseline evaluation and incremental scoring, improving the assessment of proposed paths. - Implemented `_apply_slot_diff_to_steps` and `_score_incremental_slot_diffs` functions to manage slot differences and evaluate their impact on quality scores. - Updated `ProgressionGraphEditor` to streamline the match comparison flow, integrating new evaluation parameters and improving user notifications. - Enhanced `ProgressionOptimizeCompareModal` to better display proposed path suggestions, including pro/con evaluations and quality delta metrics. - Refactored utility functions for clearer handling of slot differences and improved overall data management in the progression graph editor.	2026-06-13 10:11:10 +02:00
Lars	19bbcdaf50	Refactor Progression Comparison Logic and Enhance UI Components All checks were successful Deploy Development / deploy (push) Successful in 45s Details Test Suite / pytest-backend (push) Successful in 43s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 34s Details Test Suite / playwright-tests (push) Successful in 1m19s Details - Introduced new utility functions for comparing slot differences, including `compareDiffKind`, `annotateCompareDiffKinds`, and various filtering functions to streamline the comparison process. - Updated `ProgressionGraphEditor` to utilize the new comparison logic, improving the handling of slot differences and user notifications. - Enhanced `ProgressionOptimizeCompareModal` to better manage proposed path suggestions, including clearer messaging and improved selection handling for optional replacements. - Adjusted frontend components to reflect changes in comparison logic, ensuring a more intuitive user experience in managing progression paths.	2026-06-13 09:02:15 +02:00
Lars	cec96ae473	Implement Progression Comparison Logic and Refactor Fetching Methods All checks were successful Deploy Development / deploy (push) Successful in 44s Details Test Suite / pytest-backend (push) Successful in 44s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 13s Details Test Suite / k6 /health Baseline (push) Successful in 34s Details Test Suite / playwright-tests (push) Successful in 1m13s Details - Introduced `buildProgressionComparePayload` to create a structured comparison response from baseline and proposed evaluation results, enhancing clarity in slot differences. - Refactored `fetchMatchCompare` to `fetchFullMatch` for improved clarity and functionality in fetching progression paths. - Updated `runMatchCompareFlow` to streamline the evaluation process, integrating baseline and match results for a comprehensive comparison. - Enhanced utility functions for managing slot differences and gap fill offers, improving overall data handling in the progression graph editor. - Adjusted frontend components to reflect these changes, ensuring a more intuitive user experience in managing progression paths.	2026-06-13 08:46:10 +02:00
Lars	53f1c7161f	Refactor AI Gap Fill and Progression Path Evaluation Logic Some checks failed Deploy Development / deploy (push) Successful in 45s Details Test Suite / pytest-backend (push) Successful in 44s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 13s Details Test Suite / k6 /health Baseline (push) Successful in 34s Details Test Suite / playwright-tests (push) Has been cancelled Details - Removed the `try_suggest_ai_stage_step` function from `_enrich_roadmap_unfilled_gap_offers`, simplifying the gap fill offer generation process. - Updated `_run_evaluate_only_path_qa` and `suggest_progression_path` to disable AI calls and proposals, enhancing control over evaluation parameters. - Adjusted `ProgressionGraphEditor` to reflect changes in API requests, ensuring consistent handling of evaluation data. - Added a new test to validate the behavior of proposed QA when no slot differences are present, improving test coverage for comparison logic.	2026-06-13 08:43:02 +02:00
Lars	89c6780294	Enhance AI Gap Fill Logic and Progression Path Handling All checks were successful Deploy Development / deploy (push) Successful in 49s Details Test Suite / pytest-backend (push) Successful in 45s Details Test Suite / lint-backend (push) Successful in 1s Details Test Suite / build-frontend (push) Successful in 15s Details Test Suite / k6 /health Baseline (push) Successful in 33s Details Test Suite / playwright-tests (push) Successful in 1m14s Details - Integrated `try_suggest_ai_stage_step` to suggest AI-generated gap fill steps based on user input, improving the automation of the planning process. - Updated `_enrich_roadmap_unfilled_gap_offers` to conditionally include AI gap fill proposals, enhancing the offer generation logic. - Implemented `_merge_gap_fill_offers_from_steps` to consolidate gap fill offers from various steps, ensuring a comprehensive list of available offers. - Modified `ProgressionGraphEditor` to utilize the new merging logic for gap fill offers, improving the user experience in managing offers. - Enhanced utility functions to streamline the collection and filtering of gap fill offers from API responses. - Bumped version to reflect the new features and improvements.	2026-06-13 08:36:53 +02:00
Lars	3f130aa8ad	Refactor Progression Path Evaluation and Comparison Logic All checks were successful Deploy Development / deploy (push) Successful in 45s Details Test Suite / pytest-backend (push) Successful in 44s Details Test Suite / lint-backend (push) Successful in 1s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 34s Details Test Suite / playwright-tests (push) Successful in 1m22s Details - Updated `suggest_progression_path` to utilize `evaluate_steps` for improved validation, ensuring at least one evaluation step is provided. - Modified frontend components to enhance user experience in the comparison process, including clearer messaging and improved dialog handling. - Adjusted `ProgressionGraphEditor` to streamline the comparison flow and integrate new evaluation parameters. - Enhanced `ProgressionOptimizeCompareModal` to reflect changes in comparison logic, allowing for better user interaction with proposed path suggestions. - Bumped version to reflect the new features and improvements.	2026-06-13 08:17:59 +02:00
Lars	69ce3f6975	Enhance Rematch Suggestion Logic and Progression Path Evaluation All checks were successful Deploy Development / deploy (push) Successful in 41s Details Test Suite / pytest-backend (push) Successful in 45s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 33s Details Test Suite / playwright-tests (push) Successful in 1m14s Details - Introduced `_baseline_slot_accepts_rematch_suggestion` to filter out filled or invalid slots from rematch suggestions, improving the accuracy of rematch logic. - Updated `_build_rematch_suggestion_diffs` to skip non-eligible baseline slots, streamlining the rematch suggestion process. - Added `_evaluate_steps_for_compare_qa` to evaluate steps against the current state, enhancing the quality assessment during progression path suggestions. - Modified `_build_progression_compare_response` to ensure proper handling of slot differences and quality scores, improving response clarity. - Updated frontend components to reflect changes in rematch handling and evaluation logic. - Bumped version to reflect the new features and improvements.	2026-06-13 08:02:44 +02:00
Lars	dccb065181	Enhance Slot Difference Annotation and Rematch Suggestion Logic All checks were successful Deploy Development / deploy (push) Successful in 43s Details Test Suite / pytest-backend (push) Successful in 44s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 34s Details Test Suite / playwright-tests (push) Successful in 1m13s Details - Introduced `_annotate_slot_diffs` to mark trivial ID swaps in slot differences, improving clarity in comparison results. - Added `_actionable_slot_diffs` to filter out non-actionable differences, streamlining the evaluation process. - Implemented `_build_rematch_suggestion_diffs` to generate suggestions based on rematch logs, enhancing the path optimization workflow. - Updated `_build_progression_compare_response` to incorporate actionable slot differences and rematch suggestions, improving the response structure. - Enhanced frontend components to display rematch suggestions and handle trivial differences more effectively. - Bumped version to reflect the new features and improvements.	2026-06-13 07:55:47 +02:00
Lars	e828a5da32	Enhance Progression Path Evaluation and Comparison Logic All checks were successful Deploy Development / deploy (push) Successful in 45s Details Test Suite / pytest-backend (push) Successful in 45s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 15s Details Test Suite / k6 /health Baseline (push) Successful in 38s Details Test Suite / playwright-tests (push) Successful in 1m23s Details - Introduced `_steps_to_evaluate_payloads` to convert path steps into evaluation payloads for improved quality assessments. - Updated `_build_progression_compare_response` to include a new `proposed_eval` parameter, allowing for fair quality assessment comparisons. - Enhanced `ProgressionGraphEditor` to utilize the new pipeline quality assessment data. - Modified `ProgressionOptimizeCompareModal` to display detailed comparison results, including handling of trivial slot differences and optimization hints. - Bumped version to reflect the new features and improvements.	2026-06-13 07:44:01 +02:00
Lars	5bca5ef9eb	Enhance Progression Path Evaluation and Optimization Features All checks were successful Deploy Development / deploy (push) Successful in 43s Details Test Suite / pytest-backend (push) Successful in 45s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 13s Details Test Suite / k6 /health Baseline (push) Successful in 34s Details Test Suite / playwright-tests (push) Successful in 1m18s Details - Updated `suggest_progression_path` to include additional evaluation parameters, allowing for more comprehensive path assessments. - Introduced `PathQaPipelineDetails` component to display detailed quality assessment metrics, including rematch and refine logs, in the frontend. - Enhanced `ProgressionGraphEditor` to manage proposed path evaluations and integrate quality assessment results into the draft workflow. - Improved `ProgressionOptimizeCompareModal` to present optimization hints and quality tier information for proposed paths. - Bumped version to reflect the new features and improvements.	2026-06-12 13:33:36 +02:00
Lars	5ed06002d9	Implement Comparison Logic for Progression Path Suggestions All checks were successful Deploy Development / deploy (push) Successful in 44s Details Test Suite / pytest-backend (push) Successful in 45s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 14s Details Test Suite / k6 /health Baseline (push) Successful in 33s Details Test Suite / playwright-tests (push) Successful in 1m18s Details - Added `compare_with_assignments` flag to `ProgressionPathSuggestRequest` to enable comparison of proposed paths with existing slot assignments. - Introduced `_assignment_preservation_active` function to determine if existing assignments should be preserved during path suggestions. - Enhanced `suggest_progression_path` to handle comparison logic, including validation for minimum slot assignments required for comparison. - Implemented `_build_progression_compare_response` to structure the response for comparison results, including slot differences and quality scores. - Updated frontend components to support new comparison features, including handling of slot assignments and optimization comparisons. - Bumped version to reflect the new features and improvements.	2026-06-12 13:22:04 +02:00
Lars	b8f65e04c5	Enhance Rematch Logic and Slot Filtering in Planning Path All checks were successful Deploy Development / deploy (push) Successful in 41s Details Test Suite / pytest-backend (push) Successful in 45s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 13s Details Test Suite / k6 /health Baseline (push) Successful in 33s Details Test Suite / playwright-tests (push) Successful in 1m13s Details - Introduced `filter_rematch_slot_indices` to exclude preserved slots from rematching, improving the accuracy of slot assignments. - Added `_slot_priority_for_rematch` to prioritize existing slot assignments during rematching, enhancing the robustness of the rematch process. - Updated `_run_roadmap_rematch_loop` to utilize the new filtering and prioritization logic, ensuring better handling of rematch scenarios. - Enhanced tests in `test_planning_path_rematch.py` to validate the new filtering behavior and ensure correct exercise restoration when not rejected. - Bumped version to reflect the new features and improvements.	2026-06-12 12:33:00 +02:00
Lars	f3710ac0a1	Enhance Planning Catalog Context Integration in Progression Path All checks were successful Deploy Development / deploy (push) Successful in 45s Details Test Suite / pytest-backend (push) Successful in 43s Details Test Suite / lint-backend (push) Successful in 0s Details Test Suite / build-frontend (push) Successful in 13s Details Test Suite / k6 /health Baseline (push) Successful in 33s Details Test Suite / playwright-tests (push) Successful in 1m15s Details - Updated `PROJECT_STATUS.md` to reflect the addition of the Planning AI Progression Graph and its context in the roadmap. - Enhanced `DOMAIN_MODEL.md` with details on the new `planning_catalog_context` features, allowing trainers to manage curriculum stages and context. - Added tests in `test_planning_catalog_context.py` to validate the separation of LLM highlights from fix hints during QA processes. - Updated `HANDOVER.md` and `PLANNING_KI_ROADMAP.md` to reflect the latest app version and improvements in the planning context. - Enhanced frontend components to support the new planning catalog context, including updates to `ExerciseProgressionPathBuilder` and `ProgressionGraphEditor`. - Bumped version to 0.8.233 to reflect the new features and improvements.	2026-06-12 12:25:52 +02:00

progression V2 #57

20 Commits