- R 3.0.3 is release (with installation and upgrading instructions and a list of updates, bug fixes and changes).
- Suppose a company has 5 servers, and there is a 1% chance that each server will be down. What is the probability that at least 3 servers are down?
- Mikio L. Braun, a PostDoc in machine learning at TU Berlin and co-founder and chief data scientist at streamdrill, discusses the difficulties of data analysis.
- Xi’an comments on a new paper by his PhD student called Approximate Integrated Likelihood via ABC methods.
- How people really read and share online.
- Joseph Rickert of Revolution Analytics publishes his R “meta” book, a collection of 14 books (all available online for free) that covers useful topics including basic probability and statistics, regressions, experimental design, survival analysis, times series analysis and forecasting, machine learning, bioinformatics, structural equation models and credit scoring.
- And finally, Flavio Barros compiles a list of MOOC courses on R.

17

Mar 14

## The week in stats (Mar. 17th edition)

10

Mar 14

## The week in stats (Mar. 10th edition)

- A historian, a data scientist, a programmer, a mathematician, and a philosopher discuss the question
*How likely it is that a lottery draw (6 out of 49) contains two consecutive numbers.* - Suppose that A, B, and C are uniformly distributed on [0, 1], what is the probability that the equation has real root(s)?
- Dimiter Toshkov of
*Rules of Reason*presents Predicting movie ratings with IMDb data and R and suggests a different way of awarding the Academy Awards based on statistics. - Visualized related articles are always liked by our readers. This week, we have: Plotting an Odd number of plots in single image, Beautiful table outputs in R, Visualizations on the Monopoly board, and Basketball movements visualized.
- Xi’an reviews
*Bayesian Programming*by Pierre Bessière, Emmanuel Mazer, Juan-Manuel Ahuactzin, and Kamel Mekhnacha. - Ever wonder how popular your favorite R functions are? Check out the Function Counter for R.
- And finally, Rasmus Bååth shares easy ways to create matrices in R.

03

Mar 14

## The week in stats (Mar. 3rd edition)

- Like almost every week, R articles attract lots of attention from readers. This week, we have: Quick and dirty notes on General Linear Mix Models, How to Make a Bad Password with R, rMaps and the Mexico map, How to Read Histograms and Use Them in R, Useful Functions in R for Manipulating Text Data, and Simply creating various scatter plots with ggplot.
- r4stats.com publishes a detailed report on various ways of measuring the popularity or market shares of approximately 30 software packages for analytics, including well-known names such as R, Matlab, SAS, SPSS, Stata, Python.
- Quintuitive discusses his experience and thoughts after using RStudio for one year.
- Xi’an reviews two new books this week, the first one is called Nonlinear Time Series by Randal Douc, Éric Moulines and David Stoffer, and the second is called Foundations of Statistical Algorithms by Claus Weihs, Olav Mersman and Uwe Ligges.
- If you are an active stock investor, you should consider Using CART for Stock Market Forecasting.
- And finally, Nathan Yau of FlowingData explains the statistical reasoning behind why you should buy the bigger pizza.

24

Feb 14

## The week in stats (Feb. 24th edition)

- Last Monday (Feb. 17th) was R.A. Fisher’s birthday. To honor him, Deborah G. Mayo, Professor of Philosophy at Virginia Tech, publishes
*Fisher and Neyman after anger management,*and R.A. Fisher: ‘Two New Properties of Mathematical Likelihood’. - A few articles related to visualization and graphics received lots of attention: ggplot2: Cheatsheet for Visualizing Distributions, Automatically coloring your R output in the terminal using colorout, A visual explanation of conditional probability, R: Fun with surf3D function and No need for SPSS – beautiful output in R.
- Given that you have a five-card hand with ♠K and ♡K, what is the probability that you have all four Kings?
- Coursera offers a new MOOC course called Data Analysis for Genomics. The course starts on April 7, 2014.
- And finally, Arthur Charpentier (aka Freakonometrics) publishes a technical article called Identification of ARMA processes.

17

Feb 14

## The week in stats (Feb. 17th edition)

- Professor Roger Peng of the Johns Hopkins Bloomberg School of Public Health discusses the meaning of Reproducible Analysis, why it is important, and how to ensure that your R analysis is reproducible.
- A recent survey by Revolution Analytics show that R language skills attract median salaries in excess of $110,000 in the United States.
- Last week, many helpful R articles attracted attention from readers. A million ways to connect R and Excel, efficiency of Importing Large CSV Files in R, R framework with Object-Oriented Programming, ggplot Fit Line and Lattice Fit Line in R, and Interactive maps with R.
- Big Data is a popular term that everyone in almost every field discusses, however, Stephen Turner, assistant professor of public health sciences and director of the Bioinformatics Core at the University of Virginia, argues that There is no Such Thing as Biomedical “Big Data”.
- Welcome to the age of Databall – the rise of analytics usage in the NBA.
- And finally, suppose that you pick a random interger from 0 to 1000. Given that this integer is divisible by 4, what is the probability that it is also divisible by 3?