Google announces voice features, image editor, and personal AI agent for Workspace
Gmail, Docs, and Keep gain conversational voice capabilities. Google Pics offers AI-powered image generation and editing. Gemini Spark introduces a 24/7 agent for Workspace users.
1 source · single source
- Google introduced voice-activated search in Gmail Live, allowing users to query inboxes conversationally for specific information.
- Docs Live enables voice-to-text composition with structural organization and integration with Gmail, Drive, Chat and web content.
- Keep now processes spoken notes and automatically organizes them into structured lists without manual formatting.
- Google Pics, built on the Nano Banana model, provides AI image generation and editing with object-level control for creative professionals and everyday users.
- Gemini Spark launches as a 24/7 personal agent that can take actions on behalf of users within Workspace apps, initially available to Google AI Pro and Ultra subscribers.
Google announced a suite of AI-powered features across its Workspace productivity platform on May 19, 2026. The rollout targets the 4 billion users of Gmail, Docs, Drive, and related applications, positioning conversational AI and autonomous agents as standard tools for work and content creation.
Voice interaction capabilities are arriving in three core apps. Gmail Live transforms email search through conversational queries, enabling users to ask questions like 'What's my flight's gate number?' rather than manually searching. Docs Live functions as a co-writing assistant that accepts spoken input, structures arguments, and incorporates information from linked Gmail, Drive, Chat, and web sources into documents. Keep extends this pattern to note-taking, automatically converting rambling speech input into organized lists and structured notes.
Google introduced Google Pics as a dedicated image generation and editing tool. Built on Google's Nano Banana model, the tool emphasizes granular creative control—users can select, move, resize, and modify specific objects within images without regenerating the entire composition. The platform targets both professional design work and casual content creation.
Gemini Spark represents an escalation toward agentic workflows. Positioned as a '24/7 personal AI agent,' Spark operates within the Gemini app and can execute actions across Workspace applications on user direction. The exact scope of autonomous action remains unspecified in the announcement.
Rollout phases vary by user tier. Voice features in Gmail, Docs, and Keep launch in summer 2026 to Google AI Pro and Ultra subscribers, with preview access for Workspace business customers. Google Pics availability details were incomplete in the announcement. Gemini Spark's launch timeline and tier eligibility were not specified.
- May 21, 2026 · TechCrunch
Spotify launches ElevenLabs-powered audiobook creation tool for independent authors
Trust54 - May 20, 2026 · Hugging Face
Hugging Face releases six Ettin reranker models with distillation training recipe
Trust74 - May 18, 2026 · Hugging Face
Hugging Face releases fine-tuning guide for NVIDIA Cosmos video model using LoRA and DoRA
Trust74