1. A method used to classify the output into a finite number of classes:
(A) Database (B) Online shopping (C) Classification (D) None of these
Answer: (C) Classification
2. If the number of output classes is two, then the classification is:
(A) Datasets (B) Binary classification (C) Both A & B (D) None of these
Answer: (B) Binary classification
3. They are responsible for handling large amounts of structured and unstructured data:
(A) Scientists (B) Economists (C) Data engineers (D) Data scientists
Answer: (C) Data engineers
4. A type of machine learning based on rewards and punishments:
(A) Reinforcement learning (B) Clustering learning (C) Supervised machine learning (D) Unsupervised machine learning
Answer: (A) Reinforcement learning
5. It is a component of data science:
(A) Billing (B) Gaming (C) Machine learning (D) Social media
Answer: (C) Machine learning
Section B:
B.Write TRUE or FALSE.
1. The available data may have noise or unused information.
Answer: TRUE
2. To understand data in a better manner, different visualisation techniques must be used.
Answer: TRUE
3. The first and most important component of data science is the availability of relevant data.
Answer: TRUE
4. Data transportation can help to make predictions about unforeseen or future data.
Answer: FALSE
5. A data mining engineer is an expert who uses data to provide insights.
Answer: TRUE
Section C: Fill in the Blanks
c. Choose the correct word from the box: algorithms classes organisation knowledge cleaning
1. To handle data, a data scientist needs to use different data storage, data transformation methods, data cleaning and ________.
Answer: algorithms
2. Data mining engineers develop complex ________.
Answer: algorithms
3. Based on their extensive ________, senior data scientists establish and develop new standards for assessing data.
Answer: knowledge
4. Data science implementation is now a requirement for any ________ trying to grow.
Answer: organisation
5. The number of output ________ may be two or more than two.
Answer: classes
Section D: Short Answer Questions
1. What is data science?
Data Science is an interdisciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge and insights from structured and unstructured data.
2. Write the components of data science.
Data collection, Data cleaning, Data transformation, Data analysis, Data visualization, Machine learning/Algorithms, and Interpretation of results.
State the tasks of a data scientist.
Collecting and cleaning data, exploratory data analysis, building predictive models, data visualization, and communicating insights to stakeholders.
3. What are outliers? Give an example.
Outliers are data points that significantly differ from the other observations.
Example: In a dataset of student ages (mostly 18–22), one entry of age 80 is an outlier.
4. List any five applications of data science.
* Healthcare (disease prediction)
* Finance (fraud detection)
* E-commerce (recommendation systems)
* Social media (sentiment analysis)
* Transportation (traffic prediction)
5. Write a short note on various sources of data science.
Sources include databases, APIs, web scraping, IoT sensors, social media platforms, public datasets, transaction logs, surveys, and images/videos.

No comments:
Post a Comment