Study

APCSP Exam Review 5 Data

  •   0%
  •  0     0     0

  • What does it mean to 'clean' a dataset?
    Remove errors, duplicates, or incomplete entries.
  • What is the best way to reduce survey bias?
    Collect data from a more diverse group
  • What does a histogram show best?
    How often data falls within ranges
  • Name one field where big data is used today.
    healthcare, marketing, climate science, sports
  • True or False: A data table is a type of data visualization.
    False. It presents raw data, not visual patterns.
  • If a pie chart has 4 equal sections, what can you infer?
    Each category represents 25% of the data.
  • What is 'big data'?
    Very large and complex datasets that require special tools to analyze.
  • What is metadata?
    Data that describes other data
  • Why are data visualizations important in analysis?
    They help us spot trends and patterns more easily.
  • What is citizen science?
    Public participation in data collection
  • What is open data?
    Publicly available data anyone can use
  • What’s one risk of using data to make decisions?
    bias, misinterpretation, privacy issues
  • Does machine learning automatically remove human bias?
    No
  • What is a correlation?
    A relationship or connection between two variables.
  • What is a benefit of using open data?
    It allows more people to analyze and learn from the data.
  • Why is a scatter plot better than a bar chart for two numeric columns?
    It shows relationships between values
  • Why can’t a photographer filter photos by “pictures of buildings” using metadata?
    Metadata doesn’t describe image content
  • What is the main goal of data analysis?
    To turn raw data into useful information.
  • What causes bias in machine learning systems?
    Training data
  • What can a line graph show better than a bar chart?
    Trends over time
  • Give one example of qualitative data.
    color, opinion, category
  • You want to study students' music preferences. What data might you collect?
    favorite genre, hours listening per week, artists
  • True or False: The same dataset can be interpreted in many different ways.
    True
  • If all photos have the same time metadata by mistake, what becomes difficult to do? Hiint: Fllter by____
    Filter photos taken at a certain time of day
  • A dataset is just a random collection of facts.
    False. A dataset is organized and structured data.
  • True or False: Machine learning uses data to "train" algorithms to make decisions.
    True
  • What does cleaning data usually involve?
    Fixing inconsistencies and standardizing formats
  • What type of graph is best for comparing two numeric variables?
    Scatter plot
  • What type of data uses numbers to represent information?
    Quantitative data