Gemini 2.0 native image generation co-doodling
Generate text based on user input with assistance
Instruction-tuned model for a range of vision-language tasks