-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Pull requests: Unstructured-IO/unstructured
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: handle list output from group_bullet_paragraph in element apply()
#4253
opened Feb 21, 2026 by
s0wa48
Loading…
fix: accept any IO[bytes] object in convert_to_bytes()
#4241
opened Feb 16, 2026 by
bittoby
Loading…
fix: coerce None text to empty string in Text element
#4231
opened Feb 10, 2026 by
themavik
Loading…
feat: add XLSM (Excel Macro-Enabled Workbook) parsing support
#4227
opened Feb 8, 2026 by
longway-code
Loading…
docs: fix redundant whitespace in pyenv command in README
#4224
opened Feb 3, 2026 by
longway-code
Loading…
fix(deps): Update docker.elastic.co/elasticsearch/elasticsearch Docker tag to v8.19.11
dependencies
Pull requests that update a dependency file
security
#4223
opened Feb 3, 2026 by
utic-renovate
bot
Loading…
1 task
feat: Infer hierarchical heading levels (H1-H4) for PDFs
#4222
opened Feb 2, 2026 by
Angel98518
Loading…
Fix FutureWarning: Add test to verify bytes are wrapped in BytesIO for read_excel
#4213
opened Jan 27, 2026 by
Angel98518
Loading…
⚡️ Speed up function
merge_out_layout_with_ocr_layout by 30%
#4212
opened Jan 27, 2026 by
aseembits93
Loading…
feat: chunking by character and title now isolates tables
#4197
opened Jan 15, 2026 by
badGarnet
Loading…
fix: NameError: LayoutElements not defined in paddle_ocr.py
#4195
opened Jan 15, 2026 by
mohansinghi
Loading…
fix: None text attribute when normalizing Picture to Image element
#4083
opened Aug 22, 2025 by
ishahroz
Loading…
Switch from pdfminer to paves to improve robustness and use multiple CPUs
#4067
opened Jul 19, 2025 by
dhdaines
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.