Using an Iris dataset that contains measurement of iris flowers and their species classification. Write a Python for Spark that will use Decision Tree Classification for predicting the flower species based on the four measurement of iris flowers.
Added by Peter W.
Step 1
Step 1: Load the Iris dataset into a Spark DataFrame. Show more…
Show all steps
Your feedback will help us improve your experience
Willis James and 97 other AP CS educators are ready to help you.
Ask a new question
Labs
Want to see this concept in action?
Explore this concept interactively to see how it behaves as you change inputs.
Key Concepts
Recommended Videos
Load the iris dataset. This dataset contains measurements for iris flowers of three different species. Calculate the mean sepal length and sepal width for each species. Create a boxplot to compare the distributions of petal lengths across the different species.
Willis J.
Sheryl E.
The famous iris dataset (the first sheet of the spreadsheet linked above) was first published in 1936 by Ronald Fisher. The dataset contains 50 samples from 3 iris species: setosa, virginia, and versicolor. Four features are measured, all in cm: sepal length, sepal width, petal length, and petal width.What is the equation for the least square regression line where the independent or predictor variable is sepal length and the dependent or response variable is sepal width for iris versicolor? What is the predicted septal width for iris versicolor for a flower with a septal length of 7.65
Supreeta N.
Recommended Textbooks
Computer Science and Information Technology
Introduction to Programming Using Python
Computer Science - An Overview
Transcript
18,000,000+
Students on Numerade
Trusted by students at 8,000+ universities
Watch the video solution with this free unlock.
EMAIL
PASSWORD