Embed Amazon Quick Suite chat agents in business applications

Organizations can face two key challenges with conversational AI. First, users need answers where they work—in their CRM, support console, or analytics portal—not in separate tools. Second, implementing a secure conversation centered on their systems would require weeks of development to build authentication, token verification, domain security, and global distribution infrastructure.
Amazon Quick Suite's embedded chat helps solve the first challenge by bringing conversational AI directly into your apps, so users can query structured data, search documents, and trigger actions without switching tools.
In this post, we show you how to solve the second challenge with a one-click deployment solution for embedding chat agents using the Quick Suite Embedding SDK on business sites.
Solution overview
The solution uses a secure web portal for embedded chat using Amazon CloudFront for global content delivery, Amazon Cognito for OAuth 2.0 authentication, Amazon API Gateway for REST API endpoints, AWS Lambda for serverless API processing, and the OpenID Connect (OIDC) organization for identity integration with Quick Suite.
The solution uses deep protection with multiple layers of protection: DDoS protection in CloudFront, a private Amazon Simple Storage Service (Amazon S3) private bucket with original access control helps prevent direct access to front-end assets, AWS WAF level of limit protection in API Gateway, and JSON Web Token (JWT) signature verification using limited Amazon Cognic generic keys. with less privileged AWS Identity and Access Management (IAM) permissions.
The following diagram shows the structure of the solution.
The workflow consists of the following steps:
- Users access a web portal URL, which redirects to CloudFront.
- CloudFront uses native access control to download HTML, CSS, and JavaScript files from a private S3 bucket.
- The web app checks for a valid authentication token and redirects unauthorized users to an Amazon Cognito managed UI for OAuth 2.0 login.
- Users enter information on an Amazon Cognito login page, which authenticates them and redirects back to the CloudFront URL with a one-time authorization code.
- The application generates an authorization code and makes an HTTPS API call to the API Gateway, which bypasses the AWS WAF rate limit.
- API Gateway invokes the Lambda function with the authorization code.
- The Lambda function makes a server-to-server HTTPS call to an Amazon Cognito OAuth token repository, exchanging an authorization code for JWT tokens (ID token, access token, refresh token).
- The function verifies the cryptographic signature of the ID token using the Amazon Cognito public keys JSON Web Key Set (JWKS) with thread-safe caching.
The following is an example JWT done in code:
- The Lambda function calls the AWS Security Token Service (AWS STS) AssumeRoleWithWebIdentity API with an authenticated ID token to assume the IAM web identity role and accept temporary AWS credentials.
- The function uses temporary information to call the Quick Suite ListUsers API to verify that the user exists, and then calls the GenerateEmbedUrlForRegisteredUser API to help generate a secure embedded URL with domain restrictions.
- The function returns an embedded URL in a JSON response with cross-resource sharing (CORS) headers through API Gateway to CloudFront. The following is an example of an embed URL:
- The CloudFront application uses the Quick Suite Embedding SDK to create an embedding context and provide a conversational interface through an HTML iframe with secure connections from other locations.
You can deploy a solution with the following high-level steps:
- Deploy a serverless infrastructure using the AWS Cloud Development Kit (AWS CDK).
- Assign users to Amazon Cognito and Quick Suite.
- Share Quick Suite assets (chat agent and related communications, knowledge base).
- Access the web portal to use Quick Suite chat agents.
What is required
The following requirements are required to use the solution shown in this post:
Deploy a serverless infrastructure using the AWS CDK
Complete the following steps to deploy a serverless infrastructure using the AWS CDK:
- Clone a GitHub repository:
- Deploy infrastructure:
You will be asked to enter your AWS Region code, AWS CloudFormation stack ID and portal title, and your AWS CLI profile.



Assign users to Amazon Cognito and Quick Suite
Complete the following steps to provision users in Amazon Cognito and Quick Suite:
- Create an Amazon Cognito user in the Amazon Cognito user pool:

- Create an associate user in Quick Suite:

Share the Quick Suite chat agent
Complete the following steps to share your Quick Suite chat agent:
- Log in to the Quick Suite console using credentials for the Quick Suite Author Pro role.
- Select Negotiation agents in the navigation pane.
- Select the agents you want to share (for example, AnyCompany Ecom order assistant) and select Share it.

- Search for the username (for example, [email protected]) that you created earlier.
- Select Share it.


After sharing this agent, you also need to share each connected resource of the agent separately to ensure full functionality.
Access the web portal to use Quick Suite chat agents
Complete the following steps to access the web portal and start using chat agents:
- See the temporary password in the Amazon Cognito confirmation email.
- Access the CloudFront URL in your web browser with a temporary user ID and password.
- You will be prompted to change your password on your first login.
After successful login, you can see My Helper in the dialog interface.
- Select a Region to connect to custom Quick Suite chat agents.

- To see chat agents shared with you, select It was shared with me below Filter.

- Select the agent you want and start chatting.

The following screenshots show the chat interaction of a customer service representative tracking an instance of an online order and processing its refund as requested by a verified customer over the phone.




Clean up
To clean up your resources, remove used AWS resources:
The conclusion
This solution addresses the main challenges of embedding conversational AI on a large scale: obtaining the authentication of thousands of users simultaneously in all regions of the world, maintaining enterprise-level security with extensive audit trails, and simplifying deployment through automated infrastructure provisioning. You can customize portal branding, configure security policies, and integrate with existing identity providers. You can scale to thousands of users at once automatically while keeping payment rates as you go.
To try this solution, match the GitHub repository and use the complete one-click infrastructure to embed Quick Suite chat agents.
About the writers
Satyanarayana Adimula is a Senior Architect at AWS Generative AI Innovation & Delivery. Drawing on more than 20 years of data knowledge and analytics, he specializes in building agent AI systems that enable large enterprises to automate complex workflows, accelerate decision-making, and achieve measurable business results.


