Company that manufactures riding mowers wants to identify the best sales prospects for an… 1 answer below »


A bit environing the postulates:A guild that manufactures riding mowers deficiencys to warrant the best sales prospects for an intensive sales hostilities. In point, the creator is assiduous in collocateing households as prospective proprietors or nonowners on the plea of allowance (in $1000s) and lot magnitude (in 1000 ft2). The noteeting expeditions looked at a haphazard case of 24 households, attached in the improve RidingMowers.csv.--------------------------------------------------------------------------------------------------------------------Part 1:First, we decipher in the postulates into R and look at the primeval 6 rows to get a judgment of the constituency of the notice.Question 1: How divers imported/numeric unsteadys are thither and procure the narrowness, climax, median, moderation, and criterion sinuosity for each imported unsteady? [2 pts] Answer: Question 2: How divers ascititious unsteadys are thither and what are the levels of each of those unsteadys? What percentage of households in the consider were proprietors of a riding mower? [2 pts] Answer: Question 3: Engender a postulates visualization of a plantbatch of lot magnitude (x-axis) versus allowance (y-axis), color-coded by the effect unsteady, Occupation (paste your visualization at the end of this worksheet). (a) Describe the undeveloped relationship(s) of occupation to lot magnitude and allowance. (b) From the postulates visualization, which class looks to keep the conspicuous mediocre allowance, proprietors or nonowners? [2 pts for visualization, 1 pt for (a), & 1 pt for (b)] Answer:(a)(b) ------------------------------------------------------------------------------------------------------------------Part 2:Using all the postulates, fit a logistic copy of occupation on the two explanatory unsteadys (Lot Magnitude and Income, no interaction promise).Question 4: Use the output from the logistic copy to accomplish the Likelihood Relative Test, whither we allure parallel the copy delay Lot Magnitude and Allowance to the inoperative (no explanatory unsteady) copy. State the hypotheses, cupel statistic, p-value, and the quittance for this cupel. [1 pt for hypotheses, 1 pt for cupel statistics, 1 pt for p-value, & 1 pt for quittance] Answer: Question 5: Accomplish the Wald cupel for each single coefficient. State the public hypotheses for this point cupel, then finished the board lowerneath and delineate quittance environing each coefficient in the copy. [2 pts for board, 1 pt for hypotheses, & 1 pts per quittance] Estimate Std Error Z appraise Pr( > |t| ) Intercept -25.9382 11.4871 -2.258 0.0239 Income 0.0543 0.0412 Lot Magnitude 0.9638 2.038 Answer:Income:Lot Size: Question 6: For each explanatory unsteady, proportion the odds relative and procure an expound of each. [1 pt for the odds relatives & 1 pt per expoundation] OR(Income)=OR(Lot Size)= Question 7: (a) Using a preface cutoff of 50% to particularize between proprietor and nonowner, what is the percentage of households classified rightly shapeless nonowners? (b) To extension the percentage of rightly classified nonowners, should the cutoff answerance be extensiond or declined? Why? If you concluded a extension or decline would aid, what would be a undeveloped new cutoff? [(a) 1 pt (b) 2 pts] Answer:(a)(b) Question 8: Using a preface of 50% for predicting effects, engender a indistinctness board for the predicted appraises sequence the gentleman classes. Naturalized on this board, proportion the success, the precision/positive premonitory appraise, sensitivity, and the specificity of this logistic copy. [4 pts] Success = Precision/PPV = Sensitivity = Specificity = -----------------------------------------------------------------------------------------------------------------Part 3:Additionally, we would affect to use either rectirectilinear discriminant separation or quadratic discriminant separation as a irrelative way to collocate households. To particularize which species is most after a whilehold, cohibit the arrogances demanded for LDA and QDA.Question 9: Naturalized on your scrutiny of LDA and QDA arrogances, is LDA or QDA over after a whilehold? Elucidate why (e.g. what was the arrogance and what was your quittance)? [2 pts] Answer: Question 10: Using the process you clarified in the antecedent investigation, proportion the copy. (Assume multivariate normality of the unsteadys is gentleman for this postulates.) Engender an after a whilehold postulates visualization of a plantbatch of lot magnitude versus allowance delay the predicted species (paste this at the end of this worksheet). Do we see a public bear to how this species process disjoined the groups? [3 pts] Answer: Question 11: Engender a indistinctness board for the predicted classes sequence the gentleman classes. Naturalized on this board, proportion the success and the precision/positive premonitory appraise of this discriminant copy. [2 pts] Success = Precision/PPV = Question 12: We now keep accomplished 2 irrelative species processs – Logistic copy and a discriminant separation. We deficiency to particularize which species process was reform. (a) Engender a postulates visualization of the ROC flexions of twain processs on the similar descriptive (paste this visualization in the Appendix). (b) Proportion the AUC (area lower the flexion) for each process. (c) Naturalized on our ROC flexions and the AUCs for each process, which process is reform for this species drift and why? [2 pts for (a), 2 pts for (b), & 2 pts for (c)] Answer:(b) AUC for Logistic =AUC for Discriminant process = (c) -------------------------------------------------------------------------------------------------------------------Part 4:Let's suppose that the Occupation unsteady is not public, and we would affect to bunch the observations into proprietor and nonowner. Since thither are merely two categories, we particularized to fit 2 bunchs. As twain unsteadys are in promises of 1000s, this separation minority should be produced on the raw (not scaled) postulates.Question 13: Run the kMeans bunching algorithm on Lot Magnitude and Allowance delay 2 bunchs (run delay 20 haphazard starts). Engender an after a whilehold detailed postulates visualization naturalized on the bunchs root from the kMeans process; since the gentleman Occupation nature is public, mould safe to note the improve and inimprove bunched observations in some mode (paste this visualization in the Appendix at the end of the worksheet). Does 2 bunchs look most after a whilehold? How divers of the observations were inrightly classified? [3 pts] Answer: Question 14: We particularize to cupel various irrelative enumerate of bunchs to particularize the best enumerate for k; do this delay bunchs from 1 to 10 and delay a narrowness of 10 irrelative judicious haphazard starts. Engender an exploratory postulates descriptive delay the k sequences the “total delayin ss” (paste the descriptive into the Appendix). What is the enumerate of bunchs you particularize best, and why? [4 pts] Answer: Question 15: Naturalized on the “k” you particularized best, run the kMeans algorithm delay 10 judicious starts then engender a postulates visualization delay this selected enumerate of bunchs intermittently mould safe to note the gentleman Occupation in some mode (paste the visualization into the Appendix). Describe your results. [3 pts] Answer: ---------------------------------------------------------------------------------------------------------------Part 5:We too particularize to use Hierarchical bunching to bunch the observations into proprietor and nonproprietor groups.Question 16: Does the postulates demand to be scaled? Why or why not? [2 pts] Answer: Question 17: We pick-out to use Euclidean squared interspace delay finished and mediocre linkage processs delayin the Hierarchical bunching. For each process, particularize whither you would “cut” the tree and elucidate why; enumerate the enumerate of bunchs that you are recommending for each process. [4 pts] Answer: Question 18: Naturalized on your recommendations for k, engender a colored dendrogram of the bunched groups as a postulates visualization for each process (mould safe thither is after a whilehold labels and titles) (paste this into the Appendix). [4 pts] Answer: Question 19: Engender a postulates visualization of the plant batch for each of the two hierarchical algorithms (understand after a whilehold labels, titles, etc) (paste this into the Appendix). (a) Does thither answer to be anything interesting/significant findings from these two visualizations? (b) Since we recognize the Ownership, which process (finished or mediocre linkage) is reform? [5 pts] Answer: [5pts for RScript]APPENDIXAll postulates visualizations that we ask for demand to be understandd hither delay labeling for each graph (aka what plod is it for?). Attachments: Unit-3-.docxRidingMowers.csv