Analytics Refresher

1. Which of these are encoding techniques?
Weight of evidence
One hot
TF-IDF
Log transform
Standard scaler
1. Which of the following techniques is used to reduce the dimensionality of data?
Principal Component Analysis (PCA)
K-Nearest Neighbors (KNN)
Decision Trees
Support Vector Machines (SVM)
2. Which of the following algorithms is commonly used for unsupervised learning?
Linear Regression
K-Means
DBSCAN
Decision Trees
3. What are common steps involved in the data preprocessing phase of a data science project?
Data Cleaning
Feature Engineering
Exploratory Data Analysis (EDA)
Model Training
Handling Missing Values
Scaling Features
4. Which of the following metric can be used for evaluation of a classification model?
R2 Score
Adjusted r2 score
MAD
F1-score
Pass precision
5. In data preprocessing, which methods are used for handling missing data?
Imputation
Removal of missing data points
Using mean/mode values
Standard Scaling
None of the above
6. What is the purpose of ROC (Receiver Operating Characteristic) curves in classification tasks?
To visualize the trade-off between true positive rate and false positive rate
To visualize the distribution of classes in the dataset
To assess the model's accuracy
To determine feature importance
7. When interpreting a confusion matrix, which quadrant represents instances that are falsely predicted as positive when they are actually negative?
True Positive (TP)
True Negative (TN)
False Positive (FP)
False Negative (FN)
8. Which statistical method is commonly used to detect outliers in a dataset?
Standard Deviation Method
Interquartile Range (IQR) Method
Mean Absolute Deviation (MAD) Method
Z-Score Method
9. A marketing manager wants to understand the distribution of customer ages in their target market. Which type of plot would be most appropriate for visualizing this information?
Scatter plot
Box plot
Histogram
Bar plot
10. A data analyst wants to explore the relationship between advertising spending and sales revenue for a company over time. Which type of plot would be most effective for visualizing this relationship?
Scatter plot
Line plot
Histogram
Bubble plot
11. A healthcare provider wants to identify patients at high risk of developing chronic diseases based on their medical history and lifestyle factors. Which data analysis technique would be most appropriate for this project?
Regression analysis
Cluster analysis
Association rule mining
Text mining
12. A customer service department of an e-commerce company receives a large volume of customer inquiries through emails and wants to automate the process of classifying and routing these inquiries to the appropriate departments for faster resolution.
Named Entity Recognition (NER)
Sentiment Analysis
Topic Modeling
Word Embeddings
13. A legal firm is handling a large number of legal documents and wants to automatically identify and classify entities such as names of persons, organizations, dates, and monetary values mentioned in the documents for further analysis.
Sentiment Analysis
Named Entity Recognition (NER)
Text Classification
Topic Modeling
14. A Wildlife reserve wants to keep a track on the total count of each species present. They have installed camera traps at specific regions. Which of the following techniques can help them to achieve the above?
Topic modeling & Image classification
Object detection & Image classification
Regression and Image segmentation
OCR & Sentiment analysis
2. Which of the following techniques is used to reduce the dimensionality of data?
Principal Component Analysis (PCA)
K-Nearest Neighbors (KNN)
Decision Trees
Support Vector Machines (SVM)
3. Which of the following algorithms is commonly used for unsupervised learning?
Linear Regression
K-Means
DBSCAN
Decision Trees
4. What are common steps involved in the data preprocessing phase of a data science project?
Data Cleaning
Feature Engineering
Exploratory Data Analysis (EDA)
Model Training
Handling Missing Values
Scaling Features
5. Which of the following metric can be used for evaluation of a classification model?
R2 Score
Adjusted r2 score
MAD
F1-score
Pass precision
6. In data preprocessing, which methods are used for handling missing data?
Imputation
Removal of missing data points
Using mean/mode values
Standard Scaling
None of the above
7. What is the purpose of ROC (Receiver Operating Characteristic) curves in classification tasks?
To visualize the trade-off between true positive rate and false positive rate
To visualize the distribution of classes in the dataset
To assess the model's accuracy
To determine feature importance
8. When interpreting a confusion matrix, which quadrant represents instances that are falsely predicted as positive when they are actually negative?
True Positive (TP)
True Negative (TN)
False Positive (FP)
False Negative (FN)
9. Which statistical method is commonly used to detect outliers in a dataset?
Standard Deviation Method
Interquartile Range (IQR) Method
Mean Absolute Deviation (MAD) Method
Z-Score Method
10. A marketing manager wants to understand the distribution of customer ages in their target market. Which type of plot would be most appropriate for visualizing this information?
Scatter plot
Box plot
Histogram
Bar plot
11. A data analyst wants to explore the relationship between advertising spending and sales revenue for a company over time. Which type of plot would be most effective for visualizing this relationship?
Scatter plot
Line plot
Histogram
Bubble plot
12. A healthcare provider wants to identify patients at high risk of developing chronic diseases based on their medical history and lifestyle factors. Which data analysis technique would be most appropriate for this project?
Regression analysis
Cluster analysis
Association rule mining
Text mining
13. A customer service department of an e-commerce company receives a large volume of customer inquiries through emails and wants to automate the process of classifying and routing these inquiries to the appropriate departments for faster resolution.
Named Entity Recognition (NER)
Sentiment Analysis
Topic Modeling
Word Embeddings
14. A legal firm is handling a large number of legal documents and wants to automatically identify and classify entities such as names of persons, organizations, dates, and monetary values mentioned in the documents for further analysis.
Sentiment Analysis
Named Entity Recognition (NER)
Text Classification
Topic Modeling
15. A Wildlife reserve wants to keep a track on the total count of each species present. They have installed camera traps at specific regions. Which of the following techniques can help them to achieve the above?
Topic modeling & Image classification
Object detection & Image classification
Regression and Image segmentation
OCR & Sentiment analysis
{"name":"Analytics Refresher", "url":"https://www.quiz-maker.com/QPREVIEW","txt":"1. Which of these are encoding techniques?, 1. Which of the following techniques is used to reduce the dimensionality of data?, 2. Which of the following algorithms is commonly used for unsupervised learning?","img":"https://www.quiz-maker.com/3012/images/ogquiz.png"}
Make your own Survey
- it's free to start.