Back to Describe fundamental principles of machine learning on Azure

Data and compute services for machine learning

5 minutes 5 Questions

Azure provides comprehensive data and compute services designed to support machine learning workflows efficiently. These services form the foundation for building, training, and deploying ML models at scale. **Azure Machine Learning** is the primary platform that integrates various compute and dat…

Data and Compute Services for Machine Learning in Azure

Why This Is Important

Understanding data and compute services is fundamental to passing the AI-900 exam because machine learning workflows depend heavily on proper data management and computational resources. Azure provides specialized services that handle the storage, processing, and training needs of ML projects, and knowing these services demonstrates your ability to design practical AI solutions.

What Are Data and Compute Services for ML?

Data and compute services are the infrastructure components that support machine learning workloads in Azure. They include:

Data Services:
- Azure Storage: Blob storage for large datasets, files, and unstructured data
- Azure Data Lake Storage: Optimized storage for big data analytics
- Azure SQL Database: Structured data storage for relational datasets
- Datastores in Azure Machine Learning: Abstraction layer connecting to various storage services
- Datasets: Versioned references to data used in experiments

Compute Services:
- Compute Instances: Development workstations for data scientists
- Compute Clusters: Scalable clusters for training models
- Inference Clusters: Azure Kubernetes Service (AKS) for deploying models at scale
- Attached Compute: Connect existing Azure resources like VMs or Databricks

How These Services Work Together

1. Data Ingestion: Raw data is stored in Azure Storage or Data Lake
2. Data Registration: Data is registered as a datastore in Azure ML workspace
3. Dataset Creation: Specific data subsets are created for training and testing
4. Compute Provisioning: Appropriate compute resources are selected based on workload
5. Model Training: Compute clusters process data to train models
6. Model Deployment: Inference clusters host trained models for predictions

Key Concepts to Remember

- Datastores store connection information, not the actual data
- Compute instances are for development; compute clusters are for training
- AKS clusters are recommended for production inference workloads
- Compute clusters can scale to zero nodes when not in use to save costs
- GPU-enabled compute is essential for deep learning workloads

Exam Tips: Answering Questions on Data and Compute Services

1. Match the service to the scenario: When a question describes a development environment, think compute instance. For training at scale, think compute cluster. For production deployment, think inference cluster or AKS.

2. Remember cost optimization: Questions about minimizing costs often point to compute clusters with auto-scaling and minimum nodes set to zero.

3. Distinguish datastores from datasets: Datastores are connections to storage; datasets are versioned data references used in experiments.

4. Know when to use GPUs: Deep learning and neural network questions typically require GPU compute.

5. Production vs Development: AKS is for high-scale production; Azure Container Instances (ACI) is suitable for testing and low-scale scenarios.

6. Watch for keywords: Terms like 'scalable training' suggest compute clusters, while 'real-time inference at scale' suggests AKS.

7. Data residency matters: Some questions may focus on keeping data in specific regions for compliance; choose services that support regional deployment.

8. Read carefully: The exam may include similar-sounding options; ensure you understand whether the question asks about storage, compute for training, or compute for deployment.

Test mode:

Exam (Timed)

Practice (With explanations)

Start practice test

Unlock Premium Access

Azure AI Fundamentals

Access to ALL Certifications: Study for any certification on our platform with one subscription
2292 Superior-grade Azure AI Fundamentals practice questions
Unlimited practice tests across all certifications
Detailed explanations for every question
AI-900: 5 full exams plus all other certification exams
100% Satisfaction Guaranteed: Full refund if unsatisfied
Risk-Free: 7-day free trial with all premium features!

More Data and compute services for machine learning questions

53 questions (total)

Start 53 question test