Data Analysis
Apply statistical methods and analytical techniques to extract insights from data and communicate findings effectively.
In the context of the CompTIA Data+ certification (Data+ V2), Data Analysis is defined as the systematic process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making. Unlike advanced data sci…
Concepts covered: Tailoring communication to audiences, Technical vs. non-technical reporting, Storytelling with data, Data-driven recommendations, Presentation best practices, Descriptive statistics, Measures of central tendency (mean, median, mode), Measures of dispersion (variance, standard deviation), Correlation and regression basics, Hypothesis testing fundamentals, Statistical significance and p-values, Sampling methods and techniques, Trend analysis and forecasting, A/B testing and experimentation, Debugging data analysis workflows, Identifying data quality issues, Resolving calculation errors, Using documentation and resources, Version control for data analysis, Collaborative troubleshooting, Executive summaries and key findings
Data+ - Data Analysis Example Questions
Test your knowledge of Data Analysis
Question 1
A data analyst is evaluating customer transaction amounts from 400 records. The calculated z-scores reveal that 2.8% of transactions fall beyond ±2.2 standard deviations from the mean, while the empirical rule predicts approximately 2.8% should fall beyond ±2.2 standard deviations for a normal distribution. The skewness is 0.12 and kurtosis is 3.08. Additionally, when the analyst creates a probability plot, the points align closely with the diagonal reference line. However, a Shapiro-Wilk test yields a p-value of 0.04. Given these conflicting indicators with a significance level of 0.05, which conclusion demonstrates the most sophisticated understanding of normality assessment?
Question 2
A customer insights team and a web analytics group have been collaboratively troubleshooting why user conversion funnels show different completion rates for six days. The customer insights team uses event-based tracking from the mobile app SDK, while the web analytics group relies on server-side session logs. Both teams have validated their individual tracking implementations and confirmed data collection is functioning correctly. During joint meetings, each team presents summaries of their findings but tends to defend their own methodology as the accurate baseline. A breakthrough is needed to move past this impasse. Which collaborative troubleshooting strategy would most effectively help both teams identify where their data interpretations fundamentally diverge?
Question 3
A data analyst is debugging a real-time dashboard that displays customer churn predictions. The dashboard pulls data from a stored procedure that joins five tables and applies a machine learning model's probability scores. Users report that churn probabilities occasionally display as negative values, which should be mathematically impossible given the model's sigmoid output function. The stored procedure executes with no errors, the ML model service returns valid responses, and database constraints show no violations. Network latency metrics are within acceptable ranges. Which debugging strategy would most effectively identify the source of these impossible negative probability values?