progression V2 #57

Lars · 2026-06-13T16:34:05+02:00

Lars commented

2026-06-13 16:34:05 +02:00

No description provided.

Lars added 20 commits 2026-06-13 16:34:05 +02:00

Enhance Planning Catalog Context Integration in Progression Path

Deploy Development / deploy (push) Successful in 45s

Details

Test Suite / pytest-backend (push) Successful in 43s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 13s

Details

Test Suite / k6 /health Baseline (push) Successful in 33s

Details

Test Suite / playwright-tests (push) Successful in 1m15s

Details

f3710ac0a1

- Updated `PROJECT_STATUS.md` to reflect the addition of the Planning AI Progression Graph and its context in the roadmap.
- Enhanced `DOMAIN_MODEL.md` with details on the new `planning_catalog_context` features, allowing trainers to manage curriculum stages and context.
- Added tests in `test_planning_catalog_context.py` to validate the separation of LLM highlights from fix hints during QA processes.
- Updated `HANDOVER.md` and `PLANNING_KI_ROADMAP.md` to reflect the latest app version and improvements in the planning context.
- Enhanced frontend components to support the new planning catalog context, including updates to `ExerciseProgressionPathBuilder` and `ProgressionGraphEditor`.
- Bumped version to 0.8.233 to reflect the new features and improvements.

Enhance Rematch Logic and Slot Filtering in Planning Path

Deploy Development / deploy (push) Successful in 41s

Details

Test Suite / pytest-backend (push) Successful in 45s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 13s

Details

Test Suite / k6 /health Baseline (push) Successful in 33s

Details

Test Suite / playwright-tests (push) Successful in 1m13s

Details

b8f65e04c5

- Introduced `filter_rematch_slot_indices` to exclude preserved slots from rematching, improving the accuracy of slot assignments.
- Added `_slot_priority_for_rematch` to prioritize existing slot assignments during rematching, enhancing the robustness of the rematch process.
- Updated `_run_roadmap_rematch_loop` to utilize the new filtering and prioritization logic, ensuring better handling of rematch scenarios.
- Enhanced tests in `test_planning_path_rematch.py` to validate the new filtering behavior and ensure correct exercise restoration when not rejected.
- Bumped version to reflect the new features and improvements.

Implement Comparison Logic for Progression Path Suggestions

Deploy Development / deploy (push) Successful in 44s

Details

Test Suite / pytest-backend (push) Successful in 45s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 33s

Details

Test Suite / playwright-tests (push) Successful in 1m18s

Details

5ed06002d9

- Added `compare_with_assignments` flag to `ProgressionPathSuggestRequest` to enable comparison of proposed paths with existing slot assignments.
- Introduced `_assignment_preservation_active` function to determine if existing assignments should be preserved during path suggestions.
- Enhanced `suggest_progression_path` to handle comparison logic, including validation for minimum slot assignments required for comparison.
- Implemented `_build_progression_compare_response` to structure the response for comparison results, including slot differences and quality scores.
- Updated frontend components to support new comparison features, including handling of slot assignments and optimization comparisons.
- Bumped version to reflect the new features and improvements.

Enhance Progression Path Evaluation and Optimization Features

Deploy Development / deploy (push) Successful in 43s

Details

Test Suite / pytest-backend (push) Successful in 45s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 13s

Details

Test Suite / k6 /health Baseline (push) Successful in 34s

Details

Test Suite / playwright-tests (push) Successful in 1m18s

Details

5bca5ef9eb

- Updated `suggest_progression_path` to include additional evaluation parameters, allowing for more comprehensive path assessments.
- Introduced `PathQaPipelineDetails` component to display detailed quality assessment metrics, including rematch and refine logs, in the frontend.
- Enhanced `ProgressionGraphEditor` to manage proposed path evaluations and integrate quality assessment results into the draft workflow.
- Improved `ProgressionOptimizeCompareModal` to present optimization hints and quality tier information for proposed paths.
- Bumped version to reflect the new features and improvements.

Enhance Progression Path Evaluation and Comparison Logic

Deploy Development / deploy (push) Successful in 45s

Details

Test Suite / pytest-backend (push) Successful in 45s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 15s

Details

Test Suite / k6 /health Baseline (push) Successful in 38s

Details

Test Suite / playwright-tests (push) Successful in 1m23s

Details

e828a5da32

- Introduced `_steps_to_evaluate_payloads` to convert path steps into evaluation payloads for improved quality assessments.
- Updated `_build_progression_compare_response` to include a new `proposed_eval` parameter, allowing for fair quality assessment comparisons.
- Enhanced `ProgressionGraphEditor` to utilize the new pipeline quality assessment data.
- Modified `ProgressionOptimizeCompareModal` to display detailed comparison results, including handling of trivial slot differences and optimization hints.
- Bumped version to reflect the new features and improvements.

Enhance Slot Difference Annotation and Rematch Suggestion Logic

Deploy Development / deploy (push) Successful in 43s

Details

Test Suite / pytest-backend (push) Successful in 44s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 34s

Details

Test Suite / playwright-tests (push) Successful in 1m13s

Details

dccb065181

- Introduced `_annotate_slot_diffs` to mark trivial ID swaps in slot differences, improving clarity in comparison results.
- Added `_actionable_slot_diffs` to filter out non-actionable differences, streamlining the evaluation process.
- Implemented `_build_rematch_suggestion_diffs` to generate suggestions based on rematch logs, enhancing the path optimization workflow.
- Updated `_build_progression_compare_response` to incorporate actionable slot differences and rematch suggestions, improving the response structure.
- Enhanced frontend components to display rematch suggestions and handle trivial differences more effectively.
- Bumped version to reflect the new features and improvements.

Enhance Rematch Suggestion Logic and Progression Path Evaluation

Deploy Development / deploy (push) Successful in 41s

Details

Test Suite / pytest-backend (push) Successful in 45s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 33s

Details

Test Suite / playwright-tests (push) Successful in 1m14s

Details

69ce3f6975

- Introduced `_baseline_slot_accepts_rematch_suggestion` to filter out filled or invalid slots from rematch suggestions, improving the accuracy of rematch logic.
- Updated `_build_rematch_suggestion_diffs` to skip non-eligible baseline slots, streamlining the rematch suggestion process.
- Added `_evaluate_steps_for_compare_qa` to evaluate steps against the current state, enhancing the quality assessment during progression path suggestions.
- Modified `_build_progression_compare_response` to ensure proper handling of slot differences and quality scores, improving response clarity.
- Updated frontend components to reflect changes in rematch handling and evaluation logic.
- Bumped version to reflect the new features and improvements.

Refactor Progression Path Evaluation and Comparison Logic

Deploy Development / deploy (push) Successful in 45s

Details

Test Suite / pytest-backend (push) Successful in 44s

Details

Test Suite / lint-backend (push) Successful in 1s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 34s

Details

Test Suite / playwright-tests (push) Successful in 1m22s

Details

3f130aa8ad

- Updated `suggest_progression_path` to utilize `evaluate_steps` for improved validation, ensuring at least one evaluation step is provided.
- Modified frontend components to enhance user experience in the comparison process, including clearer messaging and improved dialog handling.
- Adjusted `ProgressionGraphEditor` to streamline the comparison flow and integrate new evaluation parameters.
- Enhanced `ProgressionOptimizeCompareModal` to reflect changes in comparison logic, allowing for better user interaction with proposed path suggestions.
- Bumped version to reflect the new features and improvements.

Enhance AI Gap Fill Logic and Progression Path Handling

Deploy Development / deploy (push) Successful in 49s

Details

Test Suite / pytest-backend (push) Successful in 45s

Details

Test Suite / lint-backend (push) Successful in 1s

Details

Test Suite / build-frontend (push) Successful in 15s

Details

Test Suite / k6 /health Baseline (push) Successful in 33s

Details

Test Suite / playwright-tests (push) Successful in 1m14s

Details

89c6780294

- Integrated `try_suggest_ai_stage_step` to suggest AI-generated gap fill steps based on user input, improving the automation of the planning process.
- Updated `_enrich_roadmap_unfilled_gap_offers` to conditionally include AI gap fill proposals, enhancing the offer generation logic.
- Implemented `_merge_gap_fill_offers_from_steps` to consolidate gap fill offers from various steps, ensuring a comprehensive list of available offers.
- Modified `ProgressionGraphEditor` to utilize the new merging logic for gap fill offers, improving the user experience in managing offers.
- Enhanced utility functions to streamline the collection and filtering of gap fill offers from API responses.
- Bumped version to reflect the new features and improvements.

Refactor AI Gap Fill and Progression Path Evaluation Logic

Deploy Development / deploy (push) Successful in 45s

Details

Test Suite / pytest-backend (push) Successful in 44s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 13s

Details

Test Suite / k6 /health Baseline (push) Successful in 34s

Details

Test Suite / playwright-tests (push) Has been cancelled

Details

53f1c7161f

- Removed the `try_suggest_ai_stage_step` function from `_enrich_roadmap_unfilled_gap_offers`, simplifying the gap fill offer generation process.
- Updated `_run_evaluate_only_path_qa` and `suggest_progression_path` to disable AI calls and proposals, enhancing control over evaluation parameters.
- Adjusted `ProgressionGraphEditor` to reflect changes in API requests, ensuring consistent handling of evaluation data.
- Added a new test to validate the behavior of proposed QA when no slot differences are present, improving test coverage for comparison logic.

Implement Progression Comparison Logic and Refactor Fetching Methods

Deploy Development / deploy (push) Successful in 44s

Details

Test Suite / pytest-backend (push) Successful in 44s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 13s

Details

Test Suite / k6 /health Baseline (push) Successful in 34s

Details

Test Suite / playwright-tests (push) Successful in 1m13s

Details

cec96ae473

- Introduced `buildProgressionComparePayload` to create a structured comparison response from baseline and proposed evaluation results, enhancing clarity in slot differences.
- Refactored `fetchMatchCompare` to `fetchFullMatch` for improved clarity and functionality in fetching progression paths.
- Updated `runMatchCompareFlow` to streamline the evaluation process, integrating baseline and match results for a comprehensive comparison.
- Enhanced utility functions for managing slot differences and gap fill offers, improving overall data handling in the progression graph editor.
- Adjusted frontend components to reflect these changes, ensuring a more intuitive user experience in managing progression paths.

Refactor Progression Comparison Logic and Enhance UI Components

Deploy Development / deploy (push) Successful in 45s

Details

Test Suite / pytest-backend (push) Successful in 43s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 34s

Details

Test Suite / playwright-tests (push) Successful in 1m19s

Details

19bbcdaf50

- Introduced new utility functions for comparing slot differences, including `compareDiffKind`, `annotateCompareDiffKinds`, and various filtering functions to streamline the comparison process.
- Updated `ProgressionGraphEditor` to utilize the new comparison logic, improving the handling of slot differences and user notifications.
- Enhanced `ProgressionOptimizeCompareModal` to better manage proposed path suggestions, including clearer messaging and improved selection handling for optional replacements.
- Adjusted frontend components to reflect changes in comparison logic, ensuring a more intuitive user experience in managing progression paths.

Enhance Progression Path Comparison and Slot Evaluation Features

Deploy Development / deploy (push) Successful in 43s

Details

Test Suite / pytest-backend (push) Successful in 44s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 33s

Details

Test Suite / playwright-tests (push) Successful in 1m12s

Details

85fccdd093

- Introduced new fields in `ProgressionPathSuggestRequest` for baseline evaluation and incremental scoring, improving the assessment of proposed paths.
- Implemented `_apply_slot_diff_to_steps` and `_score_incremental_slot_diffs` functions to manage slot differences and evaluate their impact on quality scores.
- Updated `ProgressionGraphEditor` to streamline the match comparison flow, integrating new evaluation parameters and improving user notifications.
- Enhanced `ProgressionOptimizeCompareModal` to better display proposed path suggestions, including pro/con evaluations and quality delta metrics.
- Refactored utility functions for clearer handling of slot differences and improved overall data management in the progression graph editor.

Implement Quick Evaluation and Quality Scoring for Path QA

Deploy Development / deploy (push) Successful in 40s

Details

Test Suite / pytest-backend (push) Successful in 44s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 33s

Details

Test Suite / playwright-tests (push) Successful in 1m11s

Details

a1e4ad66df

- Added `_quick_evaluate_steps_qa` function to streamline path quality assessment without recursive API calls, enhancing performance for slot comparisons.
- Introduced `compute_deterministic_path_quality_score` to provide a heuristic quality score based on gaps and off-topic steps, improving evaluation accuracy.
- Updated `_run_unified_slot_improvement_review` to utilize the new quick evaluation method, optimizing the review process and integrating quality scoring.
- Enhanced `build_path_qa_summary` to include quality score calculations, ensuring comprehensive feedback on path evaluations.
- Refactored related functions for improved clarity and efficiency in handling path quality assessments.

Enhance Path QA and Progression Review Logic

Deploy Development / deploy (push) Successful in 44s

Details

Test Suite / pytest-backend (push) Successful in 44s

Details

Test Suite / lint-backend (push) Successful in 1s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 41s

Details

Test Suite / playwright-tests (push) Successful in 1m27s

Details

3468b2066e

- Introduced `_resolve_hint_major_index` to accurately map hints to major step indices, improving the handling of optimization hints in path evaluations.
- Added `_problematic_slots_from_path_qa` to identify and categorize problematic slots based on baseline QA, enhancing the quality assessment process.
- Updated `_slot_suggestion_accepted` to incorporate new parameters for slot problems and stage specifications, refining the decision-making process for slot suggestions.
- Enhanced `ProgressionGraphEditor` to improve user notifications regarding identified issues and suggestions, ensuring clearer communication of path evaluation results.
- Modified `buildProgressionComparePayload` and `buildUnifiedSlotReviewComparePayload` to support baseline evaluations, streamlining the comparison process for proposed paths.

Enhance Path Evaluation and Slot Management Features

Deploy Development / deploy (push) Successful in 44s

Details

Test Suite / pytest-backend (push) Successful in 44s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 34s

Details

Test Suite / playwright-tests (push) Successful in 1m13s

Details

e9bf5bd1a5

- Introduced `_parse_slot_refs_from_text` to extract and convert slot references from text, improving the handling of user input in path evaluations.
- Updated `_problematic_slots_from_path_qa` to utilize the new parsing function, enhancing the identification of problematic slots based on various hints and issues.
- Enhanced `ProgressionGraphEditor` and `ProgressionOptimizeCompareModal` to better display identified problem slots and their associated reasons, improving user feedback during evaluations.
- Added tests for new parsing functionality and its integration with existing slot management processes, ensuring robustness in slot reference handling.

Enhance Slot Evaluation and Scoring Mechanisms

Deploy Development / deploy (push) Successful in 46s

Details

Test Suite / pytest-backend (push) Successful in 44s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 33s

Details

Test Suite / playwright-tests (push) Successful in 1m26s

Details

cd457e3ea0

- Introduced new functions `_off_topic_semantic_scores_by_slot` and `_score_exercise_stage_fit_for_spec` to improve the evaluation of off-topic steps and exercise stage fit, enhancing the quality assessment process.
- Updated `_run_unified_slot_improvement_review` to incorporate off-topic scores and exercise stage fit scoring, refining the decision-making process for slot suggestions.
- Enhanced existing logic to streamline the handling of slot scores and improve the overall robustness of slot management in path evaluations.

Implement Off-Topic Slot Gap Specification and Unified Slot Review Enhancements

Deploy Development / deploy (push) Successful in 42s

Details

Test Suite / pytest-backend (push) Successful in 43s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 13s

Details

Test Suite / k6 /health Baseline (push) Successful in 33s

Details

Test Suite / playwright-tests (push) Successful in 1m34s

Details

f0e581a9f5

- Introduced `_build_off_topic_slot_gap_spec` to generate specifications for off-topic slots, improving the handling of filled but thematically inappropriate slots.
- Added `_build_unified_slot_review_entry` to streamline the review process for slots, incorporating various parameters for better evaluation and suggestions.
- Enhanced existing logic in slot management to improve the robustness of path evaluations and user feedback.
- Added tests for the new off-topic slot gap specification to ensure functionality and correctness.

Enhance Progression Findings and Graph Editor with Evaluation Staleness Handling

Deploy Development / deploy (push) Successful in 40s

Details

Test Suite / pytest-backend (push) Successful in 47s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 13s

Details

Test Suite / k6 /health Baseline (push) Failing after 2s

Details

Test Suite / playwright-tests (push) Successful in 1m19s

Details

5e5f4ca8d4

- Added `evaluationStale` state to `ProgressionGraphEditor` and `ProgressionFindingsPanel` to track the freshness of evaluations.
- Updated UI to display a warning when evaluations are stale, prompting users to re-evaluate the graph.
- Modified loading and evaluation functions to manage the `evaluationStale` state effectively, ensuring accurate user feedback during the evaluation process.
- Improved user notifications regarding the need for re-evaluation after changes to the graph.

Add findings_stale field to GraphPlanningRoadmapArtifact and update ProgressionGraphEditor for state management

Deploy Development / deploy (push) Successful in 43s

Details

Test Suite / pytest-backend (push) Successful in 45s

Details

Test Suite / lint-backend (push) Successful in 0s

Details

Test Suite / build-frontend (push) Successful in 14s

Details

Test Suite / k6 /health Baseline (push) Successful in 34s

Details

Test Suite / playwright-tests (push) Successful in 1m34s

Details

7265cd5a01

- Introduced `findings_stale` field in `GraphPlanningRoadmapArtifact` to track the freshness of findings.
- Updated `ProgressionGraphEditor` to manage `findingsStale` state across various functions, ensuring accurate representation of evaluation status.
- Modified related utility functions and tests to accommodate the new state, enhancing overall functionality and user feedback in the progression graph management process.

Lars merged commit ea7de64061 into main

2026-06-13 16:34:09 +02:00

Lars referenced this issue from a commit

2026-06-13 16:34:11 +02:00

Merge pull request 'progression V2' (#57) from develop into main

Sign in to join this conversation.

No reviewers

No Label

No Milestone

No project

No Assignees

1 Participants

Notifications

Due Date

The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: Lars/shinkan-jinkendo#57