Features of Structured Data – DP-900 Exam Guide
Why Is Understanding Structured Data Important?
Structured data is one of the foundational concepts in the DP-900: Microsoft Azure Data Fundamentals exam. It represents the most traditional and widely used form of data storage and is critical to understanding relational databases, data warehousing, and many Azure data services. A solid grasp of structured data features helps you distinguish it from semi-structured and unstructured data — a key skill tested on the exam.
What Is Structured Data?
Structured data is data that adheres to a fixed schema, meaning every record (row) follows the same set of fields (columns) with predefined data types. It is typically stored in relational databases using tables composed of rows and columns.
Examples of structured data include:
- Customer records in a SQL database (name, email, phone number, address)
- Financial transaction logs (date, amount, account number, transaction type)
- Employee payroll tables
- Inventory management systems
Key Features of Structured Data
1. Fixed Schema (Predefined Structure)
Structured data requires a schema to be defined before data is written. The schema specifies table names, column names, data types (e.g., INT, VARCHAR, DATE), and constraints. Every piece of data must conform to this schema.
2. Tabular Format (Rows and Columns)
Data is organized into tables. Each table has columns (also called fields or attributes) and rows (also called records or tuples). This tabular representation makes it easy to query and analyze.
3. Strongly Typed
Each column has a specific data type. For example, an Age column might be defined as an integer, while a Name column is defined as a string. This enforces data integrity and consistency.
4. Relationships Between Tables
Structured data supports relationships through primary keys and foreign keys. This allows data to be normalized and spread across multiple related tables, reducing redundancy.
5. Queried Using SQL
Structured data is typically accessed and manipulated using Structured Query Language (SQL). SQL provides powerful capabilities for filtering, sorting, joining, and aggregating data.
6. Data Integrity and Constraints
Structured data supports constraints such as NOT NULL, UNIQUE, PRIMARY KEY, FOREIGN KEY, and CHECK constraints. These ensure accuracy and reliability of the data stored.
7. Easy to Search and Analyze
Because of its consistent format, structured data is highly efficient for searching, filtering, reporting, and performing analytics.
8. ACID Compliance
Relational databases that store structured data often support ACID transactions (Atomicity, Consistency, Isolation, Durability), ensuring reliable data processing.
How Structured Data Works in Azure
Microsoft Azure provides several services for working with structured data:
- Azure SQL Database – A fully managed relational database service for structured data.
- Azure SQL Managed Instance – A managed SQL Server instance in the cloud.
- Azure Database for MySQL / PostgreSQL – Managed open-source relational database services.
- Azure Synapse Analytics – Used for large-scale structured data analytics and data warehousing.
In each of these services, data is stored in tables with a predefined schema and queried using SQL or SQL-like languages.
Structured vs. Semi-Structured vs. Unstructured Data
Structured Data: Fixed schema, tables with rows and columns, SQL-queryable (e.g., relational databases).
Semi-Structured Data: Flexible schema, uses tags or keys to organize data (e.g., JSON, XML, Key-Value pairs).
Unstructured Data: No schema, no predefined format (e.g., images, videos, audio files, free-form text).
Understanding these distinctions is essential for the DP-900 exam.
Exam Tips: Answering Questions on Features of Structured Data
1. Remember the keyword: Schema. If a question mentions a fixed or predefined schema, it is almost certainly referring to structured data. Structured data always requires the schema to be defined before data is inserted.
2. Tables = Structured. Whenever you see references to rows, columns, and tables, think structured data and relational databases.
3. SQL is the language of structured data. If a question asks what language is used to query structured data, the answer is SQL.
4. Know the difference between data types. The exam frequently tests your ability to classify data as structured, semi-structured, or unstructured. A CSV file or a relational database table is structured. A JSON document is semi-structured. A video file is unstructured.
5. Watch for constraint-related questions. Questions about enforcing data integrity, primary keys, foreign keys, or NOT NULL constraints all point to features of structured data in relational databases.
6. Azure service mapping matters. Know which Azure services handle structured data: Azure SQL Database, Azure SQL Managed Instance, Azure Database for MySQL/PostgreSQL, and Azure Synapse Analytics (for analytics on structured data).
7. Normalization is a structured data concept. If a question references normalization, reducing data redundancy, or splitting data across related tables, it relates to structured data.
8. Look for ACID properties. If a question discusses transactional consistency, atomicity, or durability, it typically refers to structured data stored in relational databases.
9. Elimination strategy: If an exam question asks about data that has no predefined format or uses self-describing tags, eliminate structured data as the answer. Use process of elimination to distinguish between the three data types.
10. Practice scenario-based questions. The DP-900 often presents real-world scenarios (e.g., storing customer orders, managing inventory) and asks you to identify the type of data or recommend an appropriate storage solution. Structured data scenarios typically involve well-defined entities with consistent attributes.