Skip to content

Pull requests: Unstructured-IO/unstructured

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix parallel PDF API script env handling
#4385 opened Jul 2, 2026 by compair-steven Loading…
feat(pdf): flag invisible text in metadata
#4383 opened Jun 30, 2026 by Success6666 Loading…
feat: add max_page to chunk_by_title and remove multipage_sections
#4382 opened Jun 30, 2026 by issahammoud Loading…
5 tasks done
fix: preserve table row boundaries
#4376 opened Jun 24, 2026 by 2830500285 Loading…
fix: derive crop box from coordinate extent in save_elements
#4371 opened Jun 9, 2026 by badGarnet Collaborator Loading…
Optimize XLSX subtable detection memory usage
#4357 opened Jun 1, 2026 by CyMule Contributor Draft
feat: expose layout confidence metadata
#4356 opened May 26, 2026 by RitwijParmar Loading…
Fix br tail text handling in HTML tables
#4351 opened May 12, 2026 by dsolankii Loading…
fix: avoid false-positive Title classification for long no-space text
#4348 opened Apr 28, 2026 by claytonlin1110 Contributor Loading…
1 of 4 tasks
feat: add AG2 multi-agent document processing example
#4326 opened Apr 7, 2026 by faridun-ag2 Loading…
7 tasks done
feat: infer hierarchical heading levels (H1-H6) for PDFs (#4204)
#4325 opened Apr 7, 2026 by statxc Loading…
2 tasks done
feat: add support python3.14
#4312 opened Apr 1, 2026 by FomalhautWeisszwerg Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.