- R 3.1.0 (codename “Spring Dance“) is released this week!
- Do you invest in the stock market? If so, you may know the so-called 60/40 rule (invest 40% in bonds and 60% in stocks). But do you really believe this strategy? Eran Raviv performs some simulation studies and tries to verify whether the 60/40 rule is a wise choice or simply a myth.
- Popular R articles of the week: “Pretty” table columns and Calculating confidence intervals for proportions by Alan Haynes (Insights of a PhD student), Interpreting interaction coefficient in R by Lionel H. (biologyforfun) and Extract CSV data from PDF files with Tabula by Nathan Yau (Flowingdata).
- And finally, the most loyal fans in the NBA are…
- To give this year’s April Fools’ day a more analytical touch, here are The 7 Funniest Data Cartoons.
- Xi’an discusses a new paper by Scott Schmidler and his Ph.D. student Douglas VanDerwerken called Parallel MCMC.
- Tim Harford of Financial Times shares his thoughts on Big Data in an article called Big data: are we making a big mistake?
- David Springate publishes three very useful R articles this week: Develop in RStudio, run in RScript, Functional programming in R, and Two R tutorials for beginners.
- And finally, Revolution Analytics summarizes some recent news and reports on how the rise of the “R” computer language brings open source to science.
- Are you a fan of Wes Anderson? Revoluntion Analytics shares some ideas on how you can bring his style to your own R charts, by making use of these Wes Anderson inspired palettes.
- Given 3 random variables X, Y and Z with known distributions, can you calculate cov(X, Y) from cov(X, Z) and cov(Y, Z)?
- Some useful R tips this week are: Filtering Data with L1 Regularisation, quickly calculating summary statistics from a data frame, A Simple Introduction to the Graphing Philosophy of ggplot2, and Visualizing principal components with R and Sochi Olympic Athletes.
- Xi’an reviews Bayesian Data Analysis by Andrew Gelman, John Carlin, Hal Stern, David Dunson, Aki Vehtari, and Don Rubin.
- And finally, Nathan Yau of FlowingData presents some visuals from a study on smoking prevalence from 1996 to 2012, and concludes that smoking rate is inversely proportional to income level.
- James Paul Peruvankal of Revoluntion Analytics shares the secrets of teaching R. Joseph Rickert of the same organization publishes some online sources to download data sets in his article called Data Sets for Data Science.
- Some interesting R related articles this week are: Species occurrence data by Karthik Ram of rOpenSci, barplot with ggplot2 by Martin Johnsson (PhD student at Linköping University), Stop using bivariate correlations for variable selection and The German Tank Problem: The Frequentist Way by Jacob Simmering (PhD student at University of Iowa), MCMC for Econometrics Students by Professor David Giles of University of Victoria (part I, part II and part III), Normality and Testing for Normality by Thomas Hopper (aka Learning as You Go), and It is time for RData files to become the standard for Data Transfer by Francis Smart (PhD student at Michigan State University).
- Xi’an discusses his new paper (with Matthew Moores and Kerrie Mengersen) called Pre-processing for approximate Bayesian computation in image analysis.
- And finally, the Royal Statistical Society publishes the Timeline of Statistics – a timeline with illustrations and texts that covers major events in the world of statistics starting from 450 BC.
- R 3.0.3 is release (with installation and upgrading instructions and a list of updates, bug fixes and changes).
- Suppose a company has 5 servers, and there is a 1% chance that each server will be down. What is the probability that at least 3 servers are down?
- Mikio L. Braun, a PostDoc in machine learning at TU Berlin and co-founder and chief data scientist at streamdrill, discusses the difficulties of data analysis.
- Xi’an comments on a new paper by his PhD student called Approximate Integrated Likelihood via ABC methods.
- How people really read and share online.
- Joseph Rickert of Revolution Analytics publishes his R “meta” book, a collection of 14 books (all available online for free) that covers useful topics including basic probability and statistics, regressions, experimental design, survival analysis, times series analysis and forecasting, machine learning, bioinformatics, structural equation models and credit scoring.
- And finally, Flavio Barros compiles a list of MOOC courses on R.