Complex Table/Cells #360
wolfassi123
started this conversation in
General
Replies: 2 comments
-
Good question, I'd be interested too. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Me with my team (I'm not connected to author of deepdoctection) are trying to improve table detection and structure recognition by using different methods we are still looking for table detection but for structure recognition we are trying to include UniTable into dd pipeline |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I was wondering if any of you managed to solve some of the issues that we face when dealing with a complex table in a pdf. For example several subheaders, cells that span several columns or rows (they can span several columns or rows, but their text would span only through 1 cell/row)
Was trying to find a solution to complex table using several libraries but so far always came a bit too short.
The library does pretty good at simplifying some of the basic tasks, but still a complex table seems to have no fix yet.
Below is an example table that I made on Excel inspired by a real case scenario that I have faced. Is there a way to fix the fact that some cells/headers span several columns for example. The only detection we have that would allow us to know for how many columns a header span is basically the horizontal line at the bottom of the header.
Beta Was this translation helpful? Give feedback.
All reactions