Archives AI News

Adobe Photoshop integrates Google’s image AI “Nano Banana”

Adobe is bringing Google’s new image AI, known as “Nano Banana” (officially Gemini 2.5 Flash Image), to Photoshop as an optional tool. The article Adobe Photoshop integrates Google's image AI "Nano Banana" appeared first on THE DECODER.

David Zaslav thinks HBO Max is ‘way underpriced’

STK055 HBOMAX 2 D

Everyone’s favorite CEO, Warner Bros. Discovery head David Zaslav, thinks HBO Max is ripe for a price hike. Speaking at the Goldman Sachs Communacopia and Technology Conference (doesn’t that sound like a fun time?) Zaslav argued that his company’s premium output can command a premium price. “The fact that this is quality — and that’s […]

OpenAI Adds Full MCP Tool Support in ChatGPT Developer Mode: Enabling Write Actions, Workflow Automation, and Enterprise Integrations

OpenAI has just introduced a major upgrade to ChatGPT’s developer mode by adding full support for Model Context Protocol (MCP) tools. Until now, MCP integrations inside ChatGPT were limited to search and fetch operations—essentially read-only. With this update, MCP connectors can perform write actions, which means developers can now directly update systems, trigger workflows, and […] The post OpenAI Adds Full MCP Tool Support in ChatGPT Developer Mode: Enabling Write Actions, Workflow Automation, and Enterprise Integrations appeared first on MarkTechPost.

OpenAI’s gpt-realtime Enables Production-Ready Voice Agents with End-to-End Speech Processing

GettyImages 1138451118 1757517374254

OpenAI launched gpt-realtime and the Realtime API, enabling production-ready AI voice agents with end-to-end speech processing, lower latency, and natural speech delivery. New features include SIP phone support, image input, MCP server integration, and improved safeguards. Early adopters like Zillow and T-Mobile are testing real-time customer service and search use cases. By Hien Luu

What are Optical Character Recognition (OCR) Models? Top Open-Source OCR Models

Optical Character Recognition (OCR) is the process of turning images that contain text—such as scanned pages, receipts, or photographs—into machine-readable text. What began as brittle rule-based systems has evolved into a rich ecosystem of neural architectures and vision-language models capable…

Meet mmBERT: An Encoder-only Language Model Pretrained on 3T Tokens of Multilingual Text in over 1800 Languages and 2–4× Faster than Previous Models

image 24 1024x436 1

Why was a new multilingual encoder needed? XLM-RoBERTa (XLM-R) has dominated multilingual NLP for more than 5 years, an unusually long reign in AI research. While encoder-only models like BERT and RoBERTa were central to early progress, most research energy shifted toward decoder-based generative models. Encoders, however, remain more efficient and often outperform decoders on […] The post Meet mmBERT: An Encoder-only Language Model Pretrained on 3T Tokens of Multilingual Text in over 1800 Languages and 2–4× Faster than Previous Models appeared first on MarkTechPost.