Exercise 0.1
We will work with the iris dataset, which contains size measurements of different species of flowers. This dataset is built into R and can be loaded with the simple command below.
data('iris')
Let’s take a look at the data.
Read about these functions, try them on iris, and try to understand their output:
Exercise 0.2
Say we want to draw random samples from iris. How do we do this? Go to Google and search for “draw random samples from data frame R”. One of the search results will be the following question on StackOverflow:
https://stackoverflow.com/questions/8273313/sample-random-rows-in-dataframe
Note how different the solutions are. Try loading the dplyr library and using the sample_n() function to draw random samples from iris.
Exercise 0.3
In this exercise we will try to write a script.
- Make a directory structure to store scripts for the exercises: r_for_statistics > exercises
- Make a script in the exercises directory: exercise_0_2.R
- Load the “ggplot2” and “magrittr” libraries
- If you didn’t already install these, do it now
- Write the following code:
- ggplot(iris, aes(Sepal.Width)) + geom_histogram(bins=10)
- Try evaluating:
- A single line of code (ctrl + enter)
- A block of code (highlight + ctrl + enter)
- The entire file (ctrl + alt + enter)
- Write comments for the code
LS0tCnRpdGxlOiAiRXhlcmNpc2UgMCIKb3V0cHV0OiBodG1sX25vdGVib29rCi0tLQoKIyBFeGVyY2lzZSAwLjEKCldlIHdpbGwgd29yayB3aXRoIHRoZSBpcmlzIGRhdGFzZXQsIHdoaWNoIGNvbnRhaW5zIHNpemUgbWVhc3VyZW1lbnRzIG9mIGRpZmZlcmVudCBzcGVjaWVzIG9mIGZsb3dlcnMuIFRoaXMgZGF0YXNldCBpcyBidWlsdCBpbnRvIFIgYW5kIGNhbiBiZSBsb2FkZWQgd2l0aCB0aGUgc2ltcGxlIGNvbW1hbmQgYmVsb3cuCgpgYGB7cn0KZGF0YSgnaXJpcycpCmBgYAoKTGV0J3MgdGFrZSBhIGxvb2sgYXQgdGhlIGRhdGEuCgpgYGB7cn0KaGVhZChpcmlzKQpgYGAKClJlYWQgYWJvdXQgdGhlc2UgZnVuY3Rpb25zLCB0cnkgdGhlbSBvbiBgaXJpc2AsIGFuZCB0cnkgdG8gdW5kZXJzdGFuZCB0aGVpciBvdXRwdXQ6CgoqIGBkaW1gCiogYGxlbmd0aGAKKiBgbmFtZXNgCiogYHN1bW1hcnlgCgojIEV4ZXJjaXNlIDAuMgoKU2F5IHdlIHdhbnQgdG8gZHJhdyByYW5kb20gc2FtcGxlcyBmcm9tIGBpcmlzYC4gSG93IGRvIHdlIGRvIHRoaXM/IEdvIHRvIEdvb2dsZSBhbmQgc2VhcmNoIGZvciAiZHJhdyByYW5kb20gc2FtcGxlcyBmcm9tIGRhdGEgZnJhbWUgUiIuIE9uZSBvZiB0aGUgc2VhcmNoIHJlc3VsdHMgd2lsbCBiZSB0aGUgZm9sbG93aW5nIHF1ZXN0aW9uIG9uICoqU3RhY2tPdmVyZmxvdyoqOgoKPiBodHRwczovL3N0YWNrb3ZlcmZsb3cuY29tL3F1ZXN0aW9ucy84MjczMzEzL3NhbXBsZS1yYW5kb20tcm93cy1pbi1kYXRhZnJhbWUKCk5vdGUgaG93IGRpZmZlcmVudCB0aGUgc29sdXRpb25zIGFyZS4gVHJ5IGxvYWRpbmcgdGhlIGBkcGx5cmAgbGlicmFyeSBhbmQgdXNpbmcgdGhlIGBzYW1wbGVfbigpYCBmdW5jdGlvbiB0byBkcmF3IHJhbmRvbSBzYW1wbGVzIGZyb20gYGlyaXNgLgoKCiMgRXhlcmNpc2UgMC4zCgpJbiB0aGlzIGV4ZXJjaXNlIHdlIHdpbGwgdHJ5IHRvIHdyaXRlIGEgc2NyaXB0LgoKKiBNYWtlIGEgZGlyZWN0b3J5IHN0cnVjdHVyZSB0byBzdG9yZSBzY3JpcHRzIGZvciB0aGUgZXhlcmNpc2VzOiByX2Zvcl9zdGF0aXN0aWNzID4gZXhlcmNpc2VzCiogTWFrZSBhIHNjcmlwdCBpbiB0aGUgZXhlcmNpc2VzIGRpcmVjdG9yeTogZXhlcmNpc2VfMF8yLlIKKiBMb2FkIHRoZSDigJxnZ3Bsb3Qy4oCdIGFuZCDigJxtYWdyaXR0cuKAnSBsaWJyYXJpZXMKICAgICogSWYgeW91IGRpZG7igJl0IGFscmVhZHkgaW5zdGFsbCB0aGVzZSwgZG8gaXQgbm93CiogV3JpdGUgdGhlIGZvbGxvd2luZyBjb2RlOgogICAgKiBnZ3Bsb3QoaXJpcywgYWVzKFNlcGFsLldpZHRoKSkgKyBnZW9tX2hpc3RvZ3JhbShiaW5zPTEwKQoqIFRyeSBldmFsdWF0aW5nOgogICAgKiBBIHNpbmdsZSBsaW5lIG9mIGNvZGUgKGN0cmwgKyBlbnRlcikKICAgICogQSBibG9jayBvZiBjb2RlIChoaWdobGlnaHQgKyBjdHJsICsgZW50ZXIpCiAgICAqIFRoZSBlbnRpcmUgZmlsZSAoY3RybCArIGFsdCArIGVudGVyKQoqIFdyaXRlIGNvbW1lbnRzIGZvciB0aGUgY29kZQoKCgoKCg==