Generative AI

January 26, 2026

What is Cloudbot? How the First Local Agent Stack Turns Conversations into True Automation

Cloudbot is an open source AI personal assistant that you run on your hardware. It connects large language models from…

January 25, 2026

StepFun AI Introduces Step-Deep Research: A Cost-Efficient Deep Research Agent Model Built on Atomic Energy

StepFun has launched Step-DeepResearch, the only 32B parameter to complete deep research agent that aims to transform web search into…

January 24, 2026

How Machine Learning and Semantic Embedding Reframe CVE Vulnerability Beyond CVSS Raw Scores

def visualize_results(df, priority_scores, feature_importance): fig, axes = plt.subplots(2, 3, figsize=(18, 10)) fig.suptitle('Vulnerability Scanner - ML Analysis Dashboard', fontsize=16, fontweight="bold") axes[0,…

January 23, 2026

GitHub Releases Copilot-SDK to Embed its Agenttic Timeline in Any Application

GitHub has released an internal agent runtime that powers the GitHub Copilot CLI and exposed it as a programmable SDK.…

January 23, 2026

How Does an AI Agent Choose What to Do Under Tokens, Latency, and Budget Constraints?

In this tutorial, we build a scheduling agent that intentionally monitors output quality against real-world constraints such as token usage,…

January 23, 2026

Qwen Researchers Release Qwen3-TTS: A Multilingual TTS Suite With Real-Time Latency and Smart Voice Control

Alibaba Cloud's Qwen team has open-sourced Qwen3-TTS, a family of multilingual text-to-speech models that target three main functions in a…

January 22, 2026

Microsoft Releases VibeVoice-ASR: An Integrated Speech-to-Text Model Designed to Capture 60-Minute Long-Form Audio in a Single Pass.

Microsoft released VibeVoice-ASR as part of the VibeVoice family of open source voice AI models. VibeVoice-ASR is described as an…

January 22, 2026

FlashLabs Researchers Release Chroma 1.0: A 4B Real-Time Chat Model With Personalized Voice Cloning

Chroma 1.0 is a real-time speech-to-conversation model that takes sound as input and returns sound as output while preserving speaker…

January 21, 2026

Inworld AI Releases TTS-1.5 Real-Time, Production-Grade Voice Agent

Inworld AI has introduced the Inworld TTS-1.5, an upgrade to its TTS-1 family targeting real-time voice agents with strict constraints…

January 21, 2026

Salesforce AI Introduces FOFPred: A Language-Driven Future Flow Prediction Framework That Enables Advanced Robot Control and Video Production

The Salesforce AI research team introduces FOFPred, a language-driven visual flow prediction framework that connects large-scale visual language models and…

Previous page Next page

Generative AI

What is Cloudbot? How the First Local Agent Stack Turns Conversations into True Automation

StepFun AI Introduces Step-Deep Research: A Cost-Efficient Deep Research Agent Model Built on Atomic Energy

How Machine Learning and Semantic Embedding Reframe CVE Vulnerability Beyond CVSS Raw Scores

GitHub Releases Copilot-SDK to Embed its Agenttic Timeline in Any Application

How Does an AI Agent Choose What to Do Under Tokens, Latency, and Budget Constraints?

Qwen Researchers Release Qwen3-TTS: A Multilingual TTS Suite With Real-Time Latency and Smart Voice Control

Microsoft Releases VibeVoice-ASR: An Integrated Speech-to-Text Model Designed to Capture 60-Minute Long-Form Audio in a Single Pass.

FlashLabs Researchers Release Chroma 1.0: A 4B Real-Time Chat Model With Personalized Voice Cloning

Inworld AI Releases TTS-1.5 Real-Time, Production-Grade Voice Agent

Salesforce AI Introduces FOFPred: A Language-Driven Future Flow Prediction Framework That Enables Advanced Robot Control and Video Production

Subscribers, Revenue, Market Share & Global Reach

5-return back to the base

Gemma 3 270m: Model of a hyper-effective compact of AI

Merlyn's Advice on What to Do When the World Gets You Down – The Marginalian

Cut researchers present the work that calls llms: Eliminating SQL relief to improve the accuracy of information and efficiency

OASIS: Simuleringar av social interaction mellan en miljon agent

FALCON 3 models are now available at Amazon Sagemaker Jumpstart

This AI paper introduces codesters: Physical models are symbolic language with code / guide

Meta SAM 2.1 is now available in Amazon SageMaker JumpStart