If you're building live voice or video chat with Gemini, this covers the WebSocket-based Live API end to end. It handles bidirectional audio streaming, video frames, function calling, voice activity detection for interruptions, and the newer native audio features. The guide is opinionated about using gemini-3.1-flash-live-preview and the new google-genai SDKs, with clear warnings about deprecated models. Honestly refreshing to see concrete audio format specs (16-bit PCM at 16kHz input, 24kHz output) and partner integrations like LiveKit if you'd rather use WebRTC. Good for real-time conversational AI where you need low latency and don't want to guess at the config options.
npx -y skills add google-gemini/gemini-skills --skill gemini-live-api-dev --agent claude-codeInstalls into .claude/skills of the current project.
Select a file.
prisma/skills
firebase/agent-skills
Dexploarer/hyper-forge
itsmostafa/aws-agent-skills
prisma/skills