• About
  • Documentation

  • More Universes
  • Recent Updates
  • Leader board

  • All repositories
  • All packages
  • All articles
  • All datasets
  • All system Libraries
friendly
  • Builds
  • Packages
  • Articles
  • Datasets
  • Contribution
  • Badges
  • API
  • Feed

Links tofriendly

Lahman - Sean 'Lahman' Baseball Database

Provides the tables from the 'Sean Lahman Baseball Database' as a set of R data.frames. It uses the data on pitching, hitting and fielding performance and other tables from 1871 through 2025, as recorded in the 2026 version of the database. Documentation examples show how many baseball questions can be investigated.

Last updated

13.14 score 84 stars 2 dependents 1.8k scripts 115k downloads

vcdExtra - 'vcd' Extensions and Additions

Provides additional data sets, methods and documentation to complement the 'vcd' package for Visualizing Categorical Data and the 'gnm' package for Generalized Nonlinear Models. In particular, 'vcdExtra' extends mosaic, assoc and sieve plots from 'vcd' to handle 'glm()' and 'gnm()' models and adds a 3D version in 'mosaic3d'. Additionally, methods are provided for comparing and visualizing lists of 'glm' and 'loglm' objects. This package is now a support package for the book, "Discrete Data Analysis with R" by Michael Friendly and David Meyer.

Last updated

categorical-data-visualizationgeneralized-linear-modelsmosaic-plots

12.46 score 28 stars 3 dependents 656 scripts 9.0k downloads

heplots - Visualizing Hypothesis Tests in Multivariate Linear Models

Provides HE plot and other functions for visualizing hypothesis tests in multivariate linear models. HE plots represent sums-of-squares-and-products matrices for linear hypotheses and for error using ellipses (in two dimensions) and ellipsoids (in three dimensions). It also provides other tools for analysis and graphical display of the models such as robust methods and homogeneity of variance covariance matrices. The related 'candisc' package provides visualizations in a reduced-rank canonical discriminant space when there are more than a few response variables.

Last updated

linear-hypothesesmatricesmultivariate-linear-modelsplotrepeated-measure-designsvisualizing-hypothesis-tests

12.11 score 10 stars 7 dependents 1.4k scripts 9.0k downloads

matlib - Matrix Functions for Teaching and Learning Linear Algebra and Multivariate Statistics

A collection of matrix functions for teaching and learning matrix linear algebra as used in multivariate statistical methods. Many of these functions are designed for tutorial purposes in learning matrix algebra ideas using R. In some cases, functions are provided for concepts available elsewhere in R, but where the function call or name is not obvious. In other cases, functions are provided to show or demonstrate an algorithm. In addition, a collection of functions are provided for drawing vector diagrams in 2D and 3D and for rendering matrix expressions and equations in LaTeX.

Last updated

diagramslinear-equationsmatrixmatrix-functionsmatrix-visualizervectorvignette

11.98 score 71 stars 14 dependents 1.1k scripts 7.9k downloads

candisc - Visualizing Generalized Canonical Discriminant and Canonical Correlation Analysis

Functions for computing and visualizing generalized canonical discriminant analyses and canonical correlation analysis for a multivariate linear model. Traditional canonical discriminant analysis is restricted to a one-way 'MANOVA' design and is equivalent to canonical correlation analysis between a set of quantitative response variables and a set of dummy variables coded from the factor variable. The 'candisc' package generalizes this to higher-way 'MANOVA' designs for all factors in a multivariate linear model, computing canonical scores and vectors for each term. The graphic functions provide low-rank (1D, 2D, 3D) visualizations of terms in an 'mlm' via the 'plot.candisc' and 'heplot.candisc' methods. Related plots are now provided for canonical correlation analysis when all predictors are quantitative. Methods for linear discriminant analysis are now included.

Last updated

dimension-reductionmultivariate-linear-modelsvisualization

10.09 score 16 stars 3 dependents 334 scripts 6.3k downloads

HistData - Data Sets from the History of Statistics and Data Visualization

The 'HistData' package provides a collection of small data sets that are interesting and important in the history of statistics and data visualization. The goal of the package is to make these available, both for instructional use and for historical research. Some of these present interesting challenges for graphics or analysis in R.

Last updated

graphicshistorical-data

9.16 score 69 stars 2 dependents 1.1k scripts 3.2k downloads

nestedLogit - Nested Dichotomy Logistic Regression Models

Provides functions for specifying and fitting nested dichotomy logistic regression models for a multi-category response and methods for summarising and plotting those models. Nested dichotomies are statistically independent, and hence provide an additive decomposition of tests for the overall 'polytomous' response. When the dichotomies make sense substantively, this method can be a simpler alternative to the standard 'multinomial' logistic model which compares response categories to a reference level. See: J. Fox (2016), "Applied Regression Analysis and Generalized Linear Models", 3rd Ed., ISBN 1452205663.

Last updated

logistic-regressionmultinomial-logistic-regressionpolytomous-variables

7.99 score 10 stars 57 scripts 8.2k downloads

ggbiplot - A Grammar of Graphics Implementation of Biplots

A 'ggplot2' based implementation of biplots, giving a representation of a dataset in a two dimensional space accounting for the greatest variance, together with variable vectors showing how the data variables relate to this space. It provides a replacement for stats::biplot(), but with many enhancements to control the analysis and graphical display. It implements biplot and scree plot methods which can be used with the results of prcomp(), princomp(), FactoMineR::PCA(), ade4::dudi.pca() or MASS::lda() and can be customized using 'ggplot2' techniques.

Last updated

biplotdata-visualizationdimension-reductionprincipal-component-analysis

7.83 score 15 stars 1 dependents 2.7k scripts 2.8k downloads

Guerry - Maps, Data and Methods Related to Guerry (1833) "Moral Statistics of France"

Contains maps of France in 1830 and multivariate datasets from A.-M. Guerry and others. Statistical and graphic methods related to Guerry's "Moral Statistics of France" are used to understand Guerry's data and illustrate methods. The goal is to facilitate the exploration and development of statistical and graphic methods for multivariate data in a geospatial context of historical interest.

Last updated

francemoral-statisticsmultivariate-spatial-analysis

5.81 score 2 stars 59 scripts 5.5k downloads

mvinfluence - Influence Measures and Diagnostic Plots for Multivariate Linear Models

Computes regression deletion diagnostics for multivariate linear models and provides some associated diagnostic plots. The diagnostic measures include hat-values (leverages), generalized Cook's distance, and generalized squared 'studentized' residuals. Several types of plots to detect influential observations are provided.

Last updated

multivariate-analysismultivariate-linear-regressionstatisticsvisualization

5.38 score 2 stars 37 scripts 3.2k downloads

colorize - Render Text in Color for Markdown/Quarto Documents

Provides some simple functions for printing text in color in 'markdown' or 'Quarto' documents, to be rendered as HTML or LaTeX. This is useful when writing about the use of colors in graphs or tables, where you want to print their names in their actual color to give a direct impression of the color, like “red” shown in red, or “blue” shown in blue.

Last updated

quarto

4.70 score 2 stars 5 scripts 201 downloads

CASIdata - Datasets from Computer Age Statistical Inference

Provides the datasets from Efron & Hastie (2016, ISBN: 9781108107952), "Computer Age Statistical Inference: Algorithms, Evidence, and Data Science", in an accessible R format for those who want to use them for study or to try to reproduce analyses from the book.

Last updated

4.30 score 1 stars 5 scripts 172 downloads

genridge - Generalized Ridge Trace Plots for Ridge Regression

The genridge package introduces generalizations of the standard univariate ridge trace plot used in ridge regression and related methods. These graphical methods show both bias (actually, shrinkage) and precision, by plotting the covariance ellipsoids of the estimated coefficients, rather than just the estimates themselves. 2D and 3D plotting methods are provided, both in the space of the predictor variables and in the transformed space of the PCA/SVD of the predictors.

Last updated

bias-variancegraphicsprincipal-component-analysisregression-modelsridge-regressionsingular-value-decomposition

4.28 score 4 stars 95 scripts 267 downloads

ggCheysson - Graphic Styles of Emile Cheysson for 'ggplot2'

Implements for 'ggplot2' the stylistic elements (fonts, hatched patterns, color palettes) used by 'Emile Cheysson' in the 'Albums de Statistique Graphique', sometimes called the pinnacle of the Golden Age of Statistical Graphics.

Last updated

data-visualizationhistorical

4.08 score 2 stars 20 scripts

twoway - Analysis of Two-Way Tables

Carries out analyses of two-way tables with one observation per cell, together with graphical displays for an additive fit and a diagnostic plot for removable 'non-additivity' via a power transformation of the response. It implements methods from Tukey's Exploratory Data Analysis (1973) <ISBN: 978-0201076165>, including a 1-degree-of-freedom test for row*column 'non-additivity', linear in the row and column effects.

Last updated

anovaresidualstransformationstukey

3.73 score 4 stars 27 scripts 201 downloads

VisCollin - Visualizing Collinearity Diagnostics

Provides methods to calculate diagnostics for multicollinearity among predictors in a linear or generalized linear model. It also provides methods to visualize those diagnostics following Friendly & Kwan (2009), "Where’s Waldo: Visualizing Collinearity Diagnostics", <doi:10.1198/tast.2009.0012>. These include better tabular presentation of collinearity diagnostics that highlight the important numbers, a semi-graphic tableplot of the diagnostics to make warning and danger levels more salient, and a "collinearity biplot" of the smallest dimensions of predictor space, where collinearity is most apparent.

Last updated

biplotscollinearity-diagnosticsgraphicsregression-models

3.60 score 1 stars 20 scripts 223 downloads

WordPools - Word Pools Used in Studies of Learning and Memory

Collects several classical word pools used most often to provide lists of words in psychological studies of learning and memory. It provides a simple function, 'pickList' for selecting random samples of words within given ranges.

Last updated

experimentmemorywordlist-generator

3.18 score 3 stars 8 scripts 188 downloads

gellipsoid - Generalized Ellipsoids

Represents generalized geometric ellipsoids with the "(U,D)" representation. It allows degenerate and/or unbounded ellipsoids, together with methods for linear and duality transformations, and for plotting. Thus ellipsoids are naturally extended to include lines, hyperplanes, points, cylinders, etc. This permits exploration of a variety to statistical issues that can be visualized using ellipsoids as discussed by Friendly, Fox & Monette (2013), Elliptical Insights: Understanding Statistical Methods Through Elliptical Geometry <doi:10.1214/12-STS402>.

Last updated

3d-graphicsellipsegeometrymatrix

2.70 score 5 scripts 212 downloads