Pipeline orchestrator #5

Open
opened 2026-06-14 18:00:04 +00:00 by glow · 0 comments
Owner

ExtractionPipeline chains all modules.

Flow: Firecrawl HTML -> Metadata -> Fields -> Quality -> Dedup -> struct

Files:

  • extractor/pipeline.py, types.py, init.py

Tests: 66/66 passing

Status: implemented and tested

ExtractionPipeline chains all modules. Flow: Firecrawl HTML -> Metadata -> Fields -> Quality -> Dedup -> struct Files: - extractor/pipeline.py, types.py, __init__.py Tests: 66/66 passing Status: implemented and tested
Sign in to join this conversation.
No labels
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
glow-all/sibyl-extractor#5
No description provided.