Bedrock model for BPMN (process workflow) search and summarization

0

Hello Team, The use case is to build an interface to allow semantic search and summarization capability based on a bucket full of BPMN xml files kept in S3 (nearly 100 BPMN xml files in total) . Instead of BPMN process flow xml library/files, alternate document library could be pdf of process flow diagrams - based on whatever works, either xml OR pdf diagrams can be arranged and stored in S3. The interface should be able to summarize one single process flow from the S3 path of 1 file as well.

First step is to build a PoC in AWS platform for this and then take it forward for higher environments. Considering RAG with OpenSearch and SageMaker for the solution: https://github.com/aws-samples/rag-with-amazon-opensearch-and-sagemaker/blob/main/cdk_stacks/README.md

I looked at this following web page for process/BPMN model to start with, but could NOT find any suitable: https://aws.amazon.com/bedrock/

What model is next best suited to start building the solution on AWS platform?

Thanks in advance for your guidance.

asked a month ago152 views
1 Answer
1

A promising approach is to leverage AWS services like Amazon Textract for extracting text, Amazon Comprehend for natural language processing tasks like summarization, and Amazon Kendra for intelligent semantic search. This solution can be complemented with AWS Lambda and API Gateway for building the interface, and optionally, Amazon SageMaker for advanced custom models. Alternatively, open-source NLP libraries like Hugging Face Transformers or spaCy can be deployed on AWS services like SageMaker or ECS.

profile picture
EXPERT
answered a month ago
  • Can Textract or Comprehend work on BPMN process workflow xml or process diagram pdf ?

  • I can tell you that these services can help extract and analyze text from BPMN process workflow files, but they may not be able to fully understand the semantics and structure of the BPMN notation or process diagrams. I am searching about how the best way to process BPMN process workflow. (Amazon Textract primarily worked with image-based files)

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions