All tags
Topic: "text-rendering"
Nano Banana 2 aka Gemini 3.1 Flash Image Preview: the new SOTA Imagegen model
gemini-3.1-flash gpt-5.2 gpt-5.3-codex opus-4.6 claude google google-deepmind microsoft anthropic perplexity-ai image-generation text-rendering 3d-imaging real-time-information agentic-ai persistent-memory multi-agent-systems tooling coding-agents task-delegation sundarpichai demishassabis mustafasuleyman yusuf_i_mehdi borisdayma aravsrinivas
Google and DeepMind launched Nano Banana 2 (aka Gemini 3.1 Flash Image Preview), a leading image generation and editing model integrated across multiple Google products with features like 4K upscaling, multi-subject consistency, and real-time search-conditioned generation. Evaluations rank it #1 in text-to-image tasks with competitive pricing. Additionally, advances in agentic coding are noted with models like GPT-5.2, GPT-5.3 Codex, Opus 4.6, and Gemini 3.1, alongside Microsoft's Copilot Tasks introducing task delegation. Persistent memory features are rolling out in Claude models, though interoperability challenges remain.
Nano Banana Pro (Gemini Image Pro) solves text-in-images, infographic generation, 2-4k resolution, and Google Search grounding
gemini-3-pro gpt-5 google openai hugging-face togethercompute lmsys image-generation text-rendering model-provenance scientific-research proof-assistance multimodal-integration api-access fine-tuning jeffdean kevinweil demishassabis
Google launched Gemini 3 Pro Image (Nano Banana Pro), a next-generation AI image generation and editing model with integrated Google Search grounding, multi-image composition, and fine-grained visual controls, offering pricing at $0.134 per 2K image and $0.24 per 4K image. It features improved text rendering with error rates dropping from 56% to 8% compared to its predecessor, and includes SynthID watermark checks for provenance. The model is available via Gemini App, API, LM Arena, Hugging Face Spaces, Together AI, and Flow. Meanwhile, OpenAI shared early experiments with GPT-5 accelerating scientific research, including proofs of previously unsolved problems in math, physics, biology, and materials science. "GPT-5 accelerated research tasks in math/physics/biology/materials; in 4, it helped find proofs of previously unsolved problems."