Exercise 0.1
We will work with the iris dataset, which contains size measurements of different species of flowers. This dataset is built into R and can be loaded with the simple command below.
data('iris')
Let’s take a look at the data.
Read about these functions, try them on iris
, and try to understand their output:
Exercise 0.2
Say we want to draw random samples from iris
. How do we do this? Go to Google and search for “draw random samples from data frame R”. One of the search results will be the following question on StackOverflow:
https://stackoverflow.com/questions/8273313/sample-random-rows-in-dataframe
Note how different the solutions are. Try loading the dplyr
library and using the sample_n()
function to draw random samples from iris
.
Exercise 0.3
In this exercise we will try to write a script.
- Make a directory structure to store scripts for the exercises: r_for_statistics > exercises
- Make a script in the exercises directory: exercise_0_2.R
- Load the “ggplot2” and “magrittr” libraries
- If you didn’t already install these, do it now
- Write the following code:
- ggplot(iris, aes(Sepal.Width)) + geom_histogram(bins=10)
- Try evaluating:
- A single line of code (ctrl + enter)
- A block of code (highlight + ctrl + enter)
- The entire file (ctrl + alt + enter)
- Write comments for the code
LS0tCnRpdGxlOiAiRXhlcmNpc2UgMCIKb3V0cHV0OiBodG1sX25vdGVib29rCi0tLQoKIyBFeGVyY2lzZSAwLjEKCldlIHdpbGwgd29yayB3aXRoIHRoZSBpcmlzIGRhdGFzZXQsIHdoaWNoIGNvbnRhaW5zIHNpemUgbWVhc3VyZW1lbnRzIG9mIGRpZmZlcmVudCBzcGVjaWVzIG9mIGZsb3dlcnMuIFRoaXMgZGF0YXNldCBpcyBidWlsdCBpbnRvIFIgYW5kIGNhbiBiZSBsb2FkZWQgd2l0aCB0aGUgc2ltcGxlIGNvbW1hbmQgYmVsb3cuCgpgYGB7cn0KZGF0YSgnaXJpcycpCmBgYAoKTGV0J3MgdGFrZSBhIGxvb2sgYXQgdGhlIGRhdGEuCgpgYGB7cn0KaGVhZChpcmlzKQpgYGAKClJlYWQgYWJvdXQgdGhlc2UgZnVuY3Rpb25zLCB0cnkgdGhlbSBvbiBgaXJpc2AsIGFuZCB0cnkgdG8gdW5kZXJzdGFuZCB0aGVpciBvdXRwdXQ6CgoqIGBkaW1gCiogYGxlbmd0aGAKKiBgbmFtZXNgCiogYHN1bW1hcnlgCgojIEV4ZXJjaXNlIDAuMgoKU2F5IHdlIHdhbnQgdG8gZHJhdyByYW5kb20gc2FtcGxlcyBmcm9tIGBpcmlzYC4gSG93IGRvIHdlIGRvIHRoaXM/IEdvIHRvIEdvb2dsZSBhbmQgc2VhcmNoIGZvciAiZHJhdyByYW5kb20gc2FtcGxlcyBmcm9tIGRhdGEgZnJhbWUgUiIuIE9uZSBvZiB0aGUgc2VhcmNoIHJlc3VsdHMgd2lsbCBiZSB0aGUgZm9sbG93aW5nIHF1ZXN0aW9uIG9uICoqU3RhY2tPdmVyZmxvdyoqOgoKPiBodHRwczovL3N0YWNrb3ZlcmZsb3cuY29tL3F1ZXN0aW9ucy84MjczMzEzL3NhbXBsZS1yYW5kb20tcm93cy1pbi1kYXRhZnJhbWUKCk5vdGUgaG93IGRpZmZlcmVudCB0aGUgc29sdXRpb25zIGFyZS4gVHJ5IGxvYWRpbmcgdGhlIGBkcGx5cmAgbGlicmFyeSBhbmQgdXNpbmcgdGhlIGBzYW1wbGVfbigpYCBmdW5jdGlvbiB0byBkcmF3IHJhbmRvbSBzYW1wbGVzIGZyb20gYGlyaXNgLgoKCiMgRXhlcmNpc2UgMC4zCgpJbiB0aGlzIGV4ZXJjaXNlIHdlIHdpbGwgdHJ5IHRvIHdyaXRlIGEgc2NyaXB0LgoKKiBNYWtlIGEgZGlyZWN0b3J5IHN0cnVjdHVyZSB0byBzdG9yZSBzY3JpcHRzIGZvciB0aGUgZXhlcmNpc2VzOiByX2Zvcl9zdGF0aXN0aWNzID4gZXhlcmNpc2VzCiogTWFrZSBhIHNjcmlwdCBpbiB0aGUgZXhlcmNpc2VzIGRpcmVjdG9yeTogZXhlcmNpc2VfMF8yLlIKKiBMb2FkIHRoZSDigJxnZ3Bsb3Qy4oCdIGFuZCDigJxtYWdyaXR0cuKAnSBsaWJyYXJpZXMKICAgICogSWYgeW91IGRpZG7igJl0IGFscmVhZHkgaW5zdGFsbCB0aGVzZSwgZG8gaXQgbm93CiogV3JpdGUgdGhlIGZvbGxvd2luZyBjb2RlOgogICAgKiBnZ3Bsb3QoaXJpcywgYWVzKFNlcGFsLldpZHRoKSkgKyBnZW9tX2hpc3RvZ3JhbShiaW5zPTEwKQoqIFRyeSBldmFsdWF0aW5nOgogICAgKiBBIHNpbmdsZSBsaW5lIG9mIGNvZGUgKGN0cmwgKyBlbnRlcikKICAgICogQSBibG9jayBvZiBjb2RlIChoaWdobGlnaHQgKyBjdHJsICsgZW50ZXIpCiAgICAqIFRoZSBlbnRpcmUgZmlsZSAoY3RybCArIGFsdCArIGVudGVyKQoqIFdyaXRlIGNvbW1lbnRzIGZvciB0aGUgY29kZQoKCgoKCg==