|
|
ee91583614
|
Update graph_derive_edges.py to version 4.3.1: Introduce precision prioritization for chunk scope, ensuring chunk candidates are favored over note scope. Adjust confidence values for explicit callouts and enhance key generation for consistent deduplication. Improve edge processing logic to reinforce the precedence of chunk scope in decision-making.
|
2026-01-11 15:08:08 +01:00 |
|
|
|
3a17b646e1
|
Update graph_derive_edges.py and ingestion_chunk_payload.py for version 4.3.0: Introduce debug logging for data transfer audits and candidate pool handling to address potential data loss. Ensure candidate_pool is explicitly retained for accurate chunk attribution, enhancing traceability and reliability in edge processing.
|
2026-01-11 14:51:38 +01:00 |
|
|
|
727de50290
|
Refine edge parsing and chunk attribution in chunking_parser.py and graph_derive_edges.py for version 4.2.9: Ensure current_edge_type persists across empty lines in callout blocks for accurate link processing. Implement two-phase synchronization for chunk authority, collecting explicit callout keys before the global scan to prevent duplicates. Enhance callout extraction logic to respect existing chunk callouts, improving deduplication and processing efficiency.
|
2026-01-11 14:30:16 +01:00 |
|
|
|
a780104b3c
|
Enhance edge processing in graph_derive_edges.py for version 4.2.9: Finalize chunk attribution with synchronization to "Semantic First" signal. Collect callout keys from candidate pool before text scan to prevent duplicates. Update callout extraction logic to ensure strict adherence to existing chunk callouts, improving deduplication and processing efficiency.
|
2026-01-11 14:07:16 +01:00 |
|
|
|
55b64c331a
|
Enhance chunking system with WP-24c v4.2.6 and v4.2.7 updates: Introduce is_meta_content flag for callouts in RawBlock, ensuring they are chunked but later removed for clean context. Update parse_blocks and propagate_section_edges to handle callout edges with explicit provenance for chunk attribution. Implement clean-context logic to remove callout syntax post-processing, maintaining chunk integrity. Adjust get_chunk_config to prioritize frontmatter overrides for chunking profiles. Update documentation to reflect these changes.
|
2026-01-11 11:14:31 +01:00 |
|
|
|
6131b315d7
|
Update graph_derive_edges.py to version 4.2.2: Implement semantic de-duplication with improved scope decision-making. Enhance edge ID calculation by prioritizing semantic grouping before scope assignment, ensuring accurate edge representation across different contexts. Update documentation to reflect changes in edge processing logic and prioritization strategy.
|
2026-01-10 22:20:13 +01:00 |
|
|
|
dfff46e45c
|
Update graph_derive_edges.py to version 4.2.1: Implement Clean-Context enhancements, including consolidated callout extraction and smart scope prioritization. Refactor callout handling to avoid duplicates and improve processing efficiency. Update documentation to reflect changes in edge extraction logic and prioritization strategy.
|
2026-01-10 22:17:03 +01:00 |
|
|
|
003a270548
|
Implement WP-24c v4.2.0: Introduce configurable header names and levels for LLM validation and Note-Scope zones in the chunking system. Update chunking models, parser, and processor to support exclusion of edge zones during chunking. Enhance documentation and configuration files to reflect new environment variables for improved flexibility in Markdown processing.
|
2026-01-10 21:46:51 +01:00 |
|
|
|
39fd15b565
|
Update graph_db_adapter.py, graph_derive_edges.py, graph_subgraph.py, graph_utils.py, ingestion_processor.py, and retriever.py to version 4.1.0: Introduce Scope-Awareness and Section-Filtering features, enhancing edge retrieval and processing. Implement Note-Scope Zones extraction from Markdown, improve edge ID generation with target_section, and prioritize Note-Scope Links during de-duplication. Update documentation for clarity and consistency across modules.
|
2026-01-10 19:55:51 +01:00 |
|
|
|
2da98e8e37
|
Update graph_derive_edges.py and graph_utils.py to version 4.1.0: Enhance edge ID generation by incorporating target_section into the ID calculation, allowing for distinct edges across different sections. Update documentation to reflect changes in ID structure and improve clarity on edge handling during de-duplication.
|
2026-01-10 15:45:26 +01:00 |
|
|
|
a852975811
|
Update qdrant_points.py, graph_utils.py, graph_derive_edges.py, and ingestion_processor.py to version 4.0.0: Implement GOLD-STANDARD identity with strict 4-parameter ID generation, eliminating rule_id and variant from ID calculations. Enhance documentation for clarity and consistency across modules, addressing ID drift and ensuring compatibility in the ingestion workflow.
|
2026-01-10 15:19:46 +01:00 |
|
|
|
d35bdc64b9
|
Implement WP-15c enhancements across graph and retrieval modules, including full metadata support for Super-Edge aggregation and Note-Level Diversity Pooling. Update scoring logic to reflect new edge handling and improve retrieval accuracy. Version updates to reflect these changes.
|
2025-12-30 21:47:18 +01:00 |
|
|
|
b7d1bcce3d
|
Rücksprung zur Vorwersion, in der 2 Kantentypen angelegt wurden
|
2025-12-29 18:04:14 +01:00 |
|
|
|
03d3173ca6
|
neu deduplizierung für callout-edges
|
2025-12-29 12:42:26 +01:00 |
|
|
|
38a61d7b50
|
Fix: Semantische Deduplizierung in graph_derive_edges.py
|
2025-12-29 12:21:57 +01:00 |
|
|
|
0a429e1f7b
|
anpassungen Kantenvergeleich
|
2025-12-29 11:45:25 +01:00 |
|
|
|
feeb7c2d92
|
Initial WP4d
|
2025-12-29 07:58:20 +01:00 |
|
|
|
19c96fd00f
|
graph refacturiert
|
2025-12-27 14:44:44 +01:00 |
|