Reactive Machines

Securing Amazon Bedrock cross-Region inference: Geographic and global

Securing Amazon Bedrock cross-Region inference: Geographic and global

The adoption and implementation of generative AI inference has increased with organizations building more operational workloads that use AI capabilities…
Getting Started with XGBoost: A Beginner-Friendly Tutorial

Getting Started with XGBoost: A Beginner-Friendly Tutorial

Among all the tools that a data scientist has, it is difficult to find one that has received a reputation…
How Omada Health scaled patient care by fine-tuning Llama models on Amazon SageMaker AI

How Omada Health scaled patient care by fine-tuning Llama models on Amazon SageMaker AI

This post is co-written with Sunaina Kavi, AI/ML Product Manager at Omada Health. Omada Health, a longtime innovator in virtual…
DeepMMSearch-R1: Enabling Multimodal LLMs in Multimodal Web Search

DeepMMSearch-R1: Enabling Multimodal LLMs in Multimodal Web Search

Multimodal Large Language Models (MLLMs) in real-world applications need access to external information sources and must constantly respond to dynamic…
Multivariate Conformal Prediction using Optimal Transport

Multivariate Conformal Prediction using Optimal Transport

Conformal prediction (CP) quantifies the uncertainty of machine learning models by constructing sets of observable outputs. These sets are constructed…
MANZANO: A Simple and Sharp Multimodal Integrated Model with Hybrid Vision Tokenizer

MANZANO: A Simple and Sharp Multimodal Integrated Model with Hybrid Vision Tokenizer

Large-scale Linguistic Models (LLMs) capable of understanding and generating visual content hold great potential. However, existing open source models often…
Crossmodal search with Amazon Nova Multimodal Embeddings

Crossmodal search with Amazon Nova Multimodal Embeddings

Amazon Nova Multimodal Embeddings processes text, documents, images, video, and audio through a single model architecture. Available through Amazon Bedrock,…
Accelerating LLM inference with post-training weight and activation using AWQ and GPTQ on Amazon SageMaker AI

Accelerating LLM inference with post-training weight and activation using AWQ and GPTQ on Amazon SageMaker AI

Foundation models (FMs) and large language models (LLMs) have been rapidly scaling, often doubling in parameter count within months, leading…
Sentiment Analysis with Text and Audio Using AWS Generative AI Services: Approaches, Challenges, and Solutions

Sentiment Analysis with Text and Audio Using AWS Generative AI Services: Approaches, Challenges, and Solutions

This post is co-written by Instituto de Ciência e Tecnologia Itaú (ICTi) and AWS. Sentiment analysis has grown increasingly important…
Which Tests Which Model? A Taxonomy of Discourse Model Testing

Which Tests Which Model? A Taxonomy of Discourse Model Testing

Speech-based models have recently gained incredible capabilities across many tasks. However, their evaluation remains inconsistent across all types of tasks…
Back to top button