Generative AI

NVIDIA garak Tutorial: Build a Full Red Team Workflow for LLM Security with Custom Probes and Detectors

NVIDIA garak Tutorial: Build a Full Red Team Workflow for LLM Security with Custom Probes and Detectors

print("n########## 5. ANALYSIS ##########") import numpy as np, pandas as pd def find_latest_report(): cands = [] for base in [os.path.expanduser("~/.local/share/garak/garak_runs"),…
Moonshot AI Releases Kimi Code CLI: Terminal AI Coding Agent Built with TypeScript for Next-Gen Agents

Moonshot AI Releases Kimi Code CLI: Terminal AI Coding Agent Built with TypeScript for Next-Gen Agents

Moonshot AI has released the Kimi Code CLI, an open source terminal-based agent. The tool reads and edits code, executes…
I-NVIDIA Ikhipha I-Nemotron 3.5 ASR: Imodeli Yokusakaza Yenqolobane Yepharamitha engu-600M Ibhala Izilimi Zendawo Ezingu-40 Ngesikhathi Sangempela

I-NVIDIA Ikhipha I-Nemotron 3.5 ASR: Imodeli Yokusakaza Yenqolobane Yepharamitha engu-600M Ibhala Izilimi Zendawo Ezingu-40 Ngesikhathi Sangempela

Ithimba le-NVIDIA le-Nemotron Speech likhiphe i-Nemotron 3.5 ASR. Kuyimodeli yepharamitha engu-600M esakaza i-Automatic Speech Recognition (ASR). Indawo yokuhlola eyodwa ibhala…
Google DeepMind Releases Gemma 4 QAT Checkpoints: Q4_0 and Mobile Format Cut On-Device Memory

Google DeepMind Releases Gemma 4 QAT Checkpoints: Q4_0 and Mobile Format Cut On-Device Memory

Google DeepMind has released Quantization-Aware Training (QAT) test environments for the Gemma 4 family. The release targets local use on…
NVIDIA AI Releases Dynamo Snapshot: A CRIU-Based Fast Startup System for AI Inference on Kubernetes

NVIDIA AI Releases Dynamo Snapshot: A CRIU-Based Fast Startup System for AI Inference on Kubernetes

In production inference deployments, demand fluctuates over time, requiring inference replicas to scale elastically. Cold-starting inference workloads on Kubernetes can…
Building a Semantic Search Engine and Open State Classifier on ResearchMath-14k Dataset

Building a Semantic Search Engine and Open State Classifier on ResearchMath-14k Dataset

from sentence_transformers import util def search(query, k=5): q = model.encode([query], normalize_embeddings=True) sims = util.cos_sim(q, emb)[0].cpu().numpy() idx = sims.argsort()[::-1][:k] print(f'n=== Query:…
Miso Labs Releases MisoTTS: An 8B Dynamic Model for Open-Weighted Text-to-Speech

Miso Labs Releases MisoTTS: An 8B Dynamic Model for Open-Weighted Text-to-Speech

Miso Labs released MisoTTS, an open-source 8-billion-parameter text-to-speech model. Produces expressive speech in both text and audio contexts. The model…
Back to top button