TEXTRACT: Anayze Table of Contents

0

Using Textract for a table of contents where each line has** TITLE . . . . Author PageNo.** Resultant table has Title and Author merges ignoring dot-leader as one column and page numbers has 2nd column. How can I get Textract to treat dot-leader as a column separator

已提问 2 个月前170 查看次数
2 回答
0

Could you provide sample image for better understanding?

AWS
已回答 2 个月前
0

Would it be feasible to process text before textract ? So you could insert some kind of well known separator to be easily recognized by the ML behind it.

profile picture
专家
已回答 2 个月前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则