OpenAI has unveiled new AI models for transcription and voice generation, featuring improved accuracy and controllability. The text-to-speech model, gpt-4o-mini-tts, allows developers to customize voice characteristics using natural language commands, while new transcription models gpt-4o-transcribe and gpt-4o-mini-transcribe replace Whisper with enhanced accuracy in noisy environments. Product Head Olivier Godement emphasizes these updates align with OpenAI's vision of building autonomous agent systems. However, unlike previous versions, these models won't be openly available due to their complexity and computing requirements.