Become an expert in R — Interactive courses, Cheat Sheets, certificates and more!
Get Started for Free

collapse-documentation

Collapse Documentation & Overview


Description

The following table fully summarizes the contents of collapse. The documentation is structured hierarchically: This is the main overview page, linking to topical overview pages and associated function pages (unless functions are documented on the topic page).

Topics and Functions

Topic Main Features / Keywords Functions
Fast Statistical Functions Fast (grouped and weighted) statistical functions for vector, matrix, data frame and grouped data frames (class 'grouped_df', dplyr compatible). fsum, fprod, fmean, fmedian, fmode, fvar, fsd, fmin, fmax, fnth, ffirst, flast, fNobs, fNdistinct
Fast Grouping and Ordering Fast (ordered) groupings from vectors, data frames, lists. 'GRP' objects are extremely efficient inputs for programming with collapse's fast functions. fgroup_by can attach them to a data frame, for fast dplyr-style grouped computations. In addition fast radix-sort based ordering, unique values/rows, factor generation, vector grouping, interactions, generalized run-length type grouping and grouping of time-sequences. GRP, as.factor_GRP, GRPnames, is.GRP, fgroup_by, fgroup_vars, fungroup, radixorder(v), funique, qF, qG, is.qG, fdroplevels, finteraction, groupid, seqid
Fast Data Manipulation Fast and flexible select, subset, summarise, transform, sort/reorder and rename data, including modifying/adding columns by reference, automated replacing/adding with lists of transformed columns, and computing columns saved as a new dataset. In addition a set of (standard evaluation) functions for fast selecting, replacing or adding data frame columns, including shortcuts to select and replace variables by data type. fselect(<-), fsubset/ss, fsummarise, (f/set)transform(v)(<-), fcompute, roworder(v), colorder(v), (f/set)rename, get_vars(<-), add_vars(<-), num_vars(<-), cat_vars(<-), char_vars(<-), fact_vars(<-), logi_vars(<-), Date_vars(<-)
Quick Data Conversion Quick conversions: data.frame <> data.table <> tibble | matrix <> list, data.frame, data.table (row- or column- wise), tibble | array > matrix, data.frame, data.table, tibble | list > data.frame, data.table, tibble | vector > factor, matrix, data.frame, data.table, tibble; and converting factors / all factor columns. qDF, qDT, qTBL, qM, qF, mrtl, mctl, as.numeric_factor, as.character_factor
Advanced Data Aggregation Fast and easy (weighted and parallelized) aggregation of multi-type data, with (multiple) functions applied to numeric and categorical columns. Also supports fully customized aggregation tasks mapping functions to columns. collap(v/g)
Data Transformations Fast row- and column- arithmetic and (object preserving) apply functionality for matrices and data frames. Fast (grouped) replacing and sweeping of statistics and (grouped and weighted) scaling / standardizing, (higher-dimensional) within- and between-transformations (i.e. centering and averaging), linear prediction and partialling out. Additional methods for grouped_df (dplyr) and pseries, pdata.frame (plm). %(r/c)r%, %(r/c)(+/-/*//)%, dapply, BY, TRA, fscale/STD, fbetween/B, fwithin/W, fHDbetween/HDB, fHDwithin/HDW
Linear Models Fast (weighted) linear model fitting with 6 different solvers and a fast F-test to test exclusion restrictions on linear models with (large) factors. flm, fFtest
Time Series and Panel Series Fast (sequences of) lags / leads and (lagged / leaded and iterated) differences, quasi-differences, (quasi-) log-differences and (compounded) growth rates on (unordered, irregular) time series and panel data. Panel data to (ts-)array conversions. Multivariate panel- auto-, partial- and cross-correlation functions. Additional methods for grouped_df (dplyr) and pseries, pdata.frame (plm). flag/L/F, fdiff/D/Dlog, fgrowth/G, psmat, psacf, pspacf, psccf
List Processing (Recursive) list search and identification, search and extract list-elements / list-subsetting, splitting, list-transpose, apply functions to lists of data frames / data objects, and (fast) generalized recursive row-binding / unlisting in 2-dimensions / to data frame. is.unlistable, ldepth, has_elem, get_elem, atomic_elem(<-), list_elem(<-), reg_elem, irreg_elem, rsplit, t_list, rapply2d, unlist2d
Summary Statistics Fast (grouped and weighted), summary statistics for cross-sectional and complex multilevel / panel data. Efficient detailed description of data frame. Fast check of variation in data (within groups / dimensions). (Weighted) pairwise correlations and covariances (with observation count, p-value and pretty printing), pairwise observation count. Some additional methods for grouped_df (dplyr) pseries and pdata.frame (plm). qsu, descr, varying, pwcor, pwcov, pwNobs
Recode and Replace Values Recode multiple values (exact or regex matching) and replace NaN/Inf/-Inf and outliers (according to 1- or 2-sided threshold or column standard-deviation) in vectors, matrices or data frames. recode_num, recode_char, replace_NA, replace_Inf, replace_outliers
Small (Helper) Functions Fast missing value detection, insertion and removal, faster nlevels for factors, fast nrow, ncol, dim (for data frames) and seq_along rows or columns, non-standard concatenation, set and extract variable labels, extract variable classes and C storage types, display variable names, labels and classes, add / remove prefix or postfix to / from column names, not-in operator, matching with error message for non-matched, check exact or near / numeric equality of multiple objects or of all elements in a list, return object with dimnames, row- or colnames efficiently set, or with all attributes removed, C-level functions to set and duplicate / copy attributes, identify categorical and date(-time) objects, Choleski (fast) inverse of symmetric PD matrix. allNA, missing_cases, na_insert, na_rm, na_omit, fnlevels, fnrow, fncol, fdim, seq_row, seq_col, .c, vlabels(<-), vclasses, vtypes, namlab, add_stub, rm_stub, %!in%, ckmatch, all_identical, all_obj_equal, setDimnames, setRownames, setColnames, unattrib, setAttrib, copyAttrib, copyMostAttrib, is.categorical, is.Date, cinv
Data and Global Macros Groningen Growth and Development Centre 10-Sector Database, World Bank World Development dataset, and some global macros containing links to the topical documentation pages (including this page), all exported objects (excluding exported S3 methods), all generic functions, the 2 datasets, all fast functions, all fast statistical (scalar-valued) functions, and all transformation operators / operator-like functions. GGDC10S, wlddev, .COLLAPSE_TOPICS, .COLLAPSE_ALL, .COLLAPSE_GENERIC, .COLLAPSE_DATA, .FAST_FUN, .FAST_STAT_FUN, .OPERATOR_FUN
Package Options Set the action taken by generic functions encountering unknown arguments. The default is "warning". Other choices are "error", "message" or "none", where "none" enables silent swallowing. options(collapse_ unused_arg_action)

Details

The added top-level documentation infrastructure in collapse allows you to effectively navigate the package. Calling ?FUN brings up the documentation page documenting the function, which contains links to associated topic pages and closely related functions. You can also call topical documentation pages directly from the console. The links to these pages are contained in the global macro .COLLAPSE_TOPICS (e.g. calling help(.COLLAPSE_TOPICS[1]) brings up this page).

Author(s)

Maintainer: Sebastian Krantz sebastian.krantz@graduateinstitute.ch

See Also


collapse

Advanced and Fast Data Transformation

v1.5.3
GPL (>= 2) | file LICENSE
Authors
Sebastian Krantz [aut, cre], Matt Dowle [ctb], Arun Srinivasan [ctb], Laurent Berge [ctb], Dirk Eddelbuettel [ctb], Josh Pasek [ctb], Kevin Tappe [ctb], R Core Team and contributors worldwide [ctb], Martyn Plummer [cph], 1999-2016 The R Core Team [cph]
Initial release
2021-03-05

We don't support your browser anymore

Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.