R package built to automate, improve, and speed everyday
Analysis and Machine Learning tasks. With a wide
variety of family functions like Machine Learning, data cleaning and
processing, EDA, Investment, NLP, Queries, Scrappers, API interactions,
lares
helps the analyst or data scientist get quick,
reproducible, robust results, without the need of repetitive coding nor
extensive programming skills. Feel free to install, use, and/or comment
on any of the code and functionalities. And if you are also colourblind,
be sure to check the colour palettes!
Don’t hesitate to contact me, and please when you do, let me know where did you first hear from the library and which family of functions you are most interested in.
## CRAN VERSION
install.packages("lares")
## DEV VERSION
# If you don't have remotes yet, run: install.packages('remotes')
remotes::install_github("laresbernardo/lares")
# Full installation with recommended dependencies (takes more time)
remotes::install_github("laresbernardo/lares", dependencies = TRUE)
Windows users: you MAY have to install
RTools
before installing the dev version. Download it here.
Vignette: Introduction
to AutoML using lares
Medium: Select the right MMM candidate based on your specific criteria and business knowledge
DataScience+: Visualizations for Classification Models Results
DataScience+: Visualizations for Regression Models Results
DataScience+: AutoML and DALEX for Dataset Understanding
DataScience+: Find Insights with Ranked Cross-Correlations
DataScience+: Portfolio’s Performance and Reporting
DataScience+: Manage Credentials Safely in R
DataScience+: Using cache to avoid re-processing in R
DataScience+: Best/Worst starting words for Wordle using simulations
DataScience+: Plot Timelines with Gantt Charts
h2o_automl()
To get insights and value out of your dataset, first you need to
understand its structure, types of data, empty values, interactions
between variables… corr_cross()
and freqs()
are here to give you just that! They show a wide perspective of your
dataset content, correlations, and frequencies. Additionally, with the
missingness()
function to detect all missing values and
df_str()
to break down you data frame’s structure, you will
be ready to squeeze valuable insights out of your data. If you’d like to
go deeper, try the x2y()
, lasso_vars()
and
tree_var()
to use Machine Learning to detect patterns,
predictive powers, and variables importance.
My favourite and most used functions are freqs()
,
distr()
, and corr_var()
. In this RMarkdown you
can see them in action. Basically, they group and count values within
variables, show distributions of one variable vs another one (numerical
or categorical), and calculate/plot correlations of one variables vs all
others, no matter what type of data you insert.
If there is space for one more, I would add ohse()
(One
Hot Smart Encoding), which has made my life much easier and my work much
valuable. It converts a whole data frame into numerical values by making
dummy variables (categoricals turned into new columns with 1s and 0s,
ordered by frequencies and grouping less frequent into a single column)
and dates into new features (such as month, year, week of the year,
minutes if time is present, holidays given a country, currency exchange
rates, etc).
You can check all active functions and documentations here
or type lares::
in RStudio and you will
get a pop-up with all the functions that are currently available within
the package. You might also want to check the whole documentation by
running help(package = "lares")
in your RStudio or in the
Online
Official Documentation. Remember to check the families and similar
functions on the See Also sections as well.
If you need help with any of the functions when using RStudio, use
the ?
function (i.e. ?lares::function
) and the
Help tab will display a short explanation on each
function and its parameters. You might also be interested in the online
documentation to check all functions and parameters.
If you encounter a bug, please share with me a reproducible example on Github issues and I’ll take care of it. For inquiries, and other matters, you can contact me on LinkedIn anytime!