Replies: 1 comment
-
I think you either need to work with rule based approaches (e.g. detecting bullet points) or you need to train a model that detects list items. The way lists are annotated on Doclaynet is different, though. Here, lists are actually list items. One idea is to train (or maybe find a model) on Doclaynet and use the list output to divide your list into items. I am experimenting with list item detection (esp. for nested lists) as well. My approach is simply to get some samples labelled and see how it works. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
What are our options for detecting list items within a list? DDT finds lists very reliably on default settings, but all the children of LayoutType.LIST seem to be LayoutType.WORD annotations. I looked at other models available (print_model_infos) and at the source code, and don't see anything indicating list item detection as a possibility, but wanted to ask just in case.
Thank you for making this!
Beta Was this translation helpful? Give feedback.
All reactions