HTML metadata extraction (OG, JSON-LD, meta tags) #1

Open
opened 2026-06-14 18:00:04 +00:00 by glow · 0 comments
Owner

Parse HTML metadata from Firecrawl output into structured fields.

Priority: OG > JSON-LD > meta tags > title element.

Files:

  • extractor/metadata.py
  • tests/test_metadata.py (15 tests)

Status: implemented and tested

Parse HTML metadata from Firecrawl output into structured fields. Priority: OG > JSON-LD > meta tags > title element. Files: - extractor/metadata.py - tests/test_metadata.py (15 tests) Status: implemented and tested
Sign in to join this conversation.
No labels
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
glow-all/sibyl-extractor#1
No description provided.