HTML Parser Completeness – Full Spec Tokenizer + Tree Construction Edge Cases #126
Labels
No labels
bug
docs
feature
housekeeping
html-spec
performance
react-compat
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
glow-all/true-headless-browser#126
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Gap
Unser HTML5 Parser (src/dom/parser.ts + src/html/HTMLParser.ts) deckt die meisten Faelle ab. Aber es gibt Spec-Luecken die bei exotischen Seiten zu falschen DOM Trees fuehren.
Was fehlt
1. Tokenizer Luecken
2. Tree Construction Luecken
3. Serialisierung
innerHTMLgetter: HTML Serialization (Void Elements, Self-Closing, Attribute Quoting)outerHTMLgetter: wie innerHTML aber inkl. Element selbst4. Encoding Detection