TEXTRACT: Anayze Table of Contents

0

Using Textract for a table of contents where each line has** TITLE . . . . Author PageNo.** Resultant table has Title and Author merges ignoring dot-leader as one column and page numbers has 2nd column. How can I get Textract to treat dot-leader as a column separator

질문됨 2달 전170회 조회
2개 답변
0

Could you provide sample image for better understanding?

AWS
답변함 2달 전
0

Would it be feasible to process text before textract ? So you could insert some kind of well known separator to be easily recognized by the ML behind it.

profile picture
전문가
답변함 2달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠