Implement generative AI solutions
Build generative AI solutions using Azure AI Foundry, Azure OpenAI, and prompt engineering.
Implementing generative AI solutions in Azure involves leveraging Microsoft's comprehensive suite of AI services to build applications that can create new content, including text, images, code, and more. The primary service for this is Azure OpenAI Service, which provides access to powerful languag…
AI-102 - Implement generative AI solutions Example Questions
Test your knowledge of Implement generative AI solutions
Question 1
What is the primary function of the 'prompt' parameter when invoking Azure OpenAI's DALL-E API for image synthesis?
Question 2
Which Azure AI capability enables developers to examine the reasoning chain and decision pathways that a model follows when processing input data and generating outputs?
Question 3
A media streaming company is building an Azure OpenAI-powered content moderation system to review user-generated video captions and comments across their platform. The system needs to detect policy violations, identify harmful content, and suggest appropriate actions. During pilot testing with 5,000 daily captions, the team deployed a GPT-3.5-Turbo model in the West Europe region using the standard consumption tier. The moderation team reports that response times vary significantly - ranging from 800ms during early morning hours to 12 seconds during evening peak times (6-10 PM) when user activity spikes. The compliance officer requires consistent moderation speed regardless of time of day, as regulatory guidelines mandate that flagged content must be reviewed within defined SLAs. The platform currently processes 40,000 captions daily, with 65% occurring during the evening window. Each caption analysis consumes approximately 800-1,200 tokens including the prompt engineering for context and policy guidelines. The CFO prefers predictable monthly costs for budgeting purposes and can accommodate higher baseline expenses if performance variability is eliminated. The engineering team has confirmed that the model type (GPT-3.5-Turbo) meets accuracy requirements. Which solution should the AI architect recommend to address the performance inconsistency?