TEXTRACT: Anayze Table of Contents

0

Using Textract for a table of contents where each line has** TITLE . . . . Author PageNo.** Resultant table has Title and Author merges ignoring dot-leader as one column and page numbers has 2nd column. How can I get Textract to treat dot-leader as a column separator

已提問 2 個月前檢視次數 170 次
2 個答案
0

Could you provide sample image for better understanding?

AWS
已回答 2 個月前
0

Would it be feasible to process text before textract ? So you could insert some kind of well known separator to be easily recognized by the ML behind it.

profile picture
專家
已回答 2 個月前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南