Masthash

#RStats

CRAN Package Updates
8 minutes ago

CRAN updates: bslib cequre fastAFT gmvarkit gpboost skilljaR #rstats

CRAN Package Updates
8 minutes ago

New CRAN package robotoolbox with initial version 1.3.2
#rstats
https://cran.r-project.org/package=robotoolbox

CRAN Package Updates
8 minutes ago

New CRAN package fritools2 with initial version 4.1.0
#rstats
https://cran.r-project.org/package=fritools2

CRAN Package Updates
8 minutes ago

New CRAN package cereal with initial version 0.1.0
#rstats
https://cran.r-project.org/package=cereal

Sharon Machlis
2 hours ago

@milesmcbain There’s also @brodriguesco 's Workshop for Ukraine on reproducible pipelines in R on June 29 that includes using {targets}, as well as {renv} and Docker #rstats

https://sites.google.com/view/dariia-mykhailyshyna/main/r-workshops-for-ukraine#h.i3fjt5lw8dyo

CRAN Package Updates
2 hours ago

CRAN updates: GJRM #rstats

In case this is useful to anyone else, here's an #rstats function I keep needing:

#' List to Text
#'
#' Convert a list or vector to text with human-readable separators, e.g., "A, B & C".
#'
#' @param x The list or vector to convert
#' @param comma The text to use to separate all but the last item
#' @param and The text to use to separate the last item
#'
#' @return A character string
#' @export
#'
#' @examples
#' comma_and(LETTERS[1:5])
#' comma_and(LETTERS[1:5], and = " and ")
#' comma_and(LETTERS[1:5], comma = "; ")
#'
#' # change and to use an oxford comma
#' my_list <- list("Nelson Mandela",
#' "an 800-year-old demigod",
#' "a dildo collector")
#' comma_and(my_list) # probably not what you mean
#' comma_and(my_list, and = ", and ")

comma_and <- function(x, comma = ", ", and = " & ") {
if (length(x) == 1) {
txt <- x
} else {
last <- x[length(x)]
first <- paste(x[1:(length(x)-1)], collapse = comma)
txt <- paste0(first, and, last)
}

return(txt)
}

CRAN Package Updates
4 hours ago

CRAN updates: survstan #rstats

CRAN Package Updates
4 hours ago

New CRAN package GGoutlieR with initial version 1.0.0
#rstats
https://cran.r-project.org/package=GGoutlieR

Datascience :verified:
4 hours ago

Customize what happens when you start R: https://henrikbengtsson.github.io/startup/ #rstats #environment

CRAN Package Updates
5 hours ago

CRAN updates: kohonen saeHB.ME.beta #rstats

CRAN Package Updates
5 hours ago

New CRAN package FuzzySimRes with initial version 0.1.2
#rstats
https://cran.r-project.org/package=FuzzySimRes

CRAN Package Updates
5 hours ago

New CRAN package epiworldR with initial version 0.0-1
#rstats
https://cran.r-project.org/package=epiworldR

CRAN Package Updates
5 hours ago

New CRAN package cvmaPLFAM with initial version 0.1.0
#rstats
https://cran.r-project.org/package=cvmaPLFAM

CRAN Package Updates
6 hours ago

CRAN updates: SIRthresholded #rstats

coolbutuseless
7 hours ago

I forgot to post the hex logo for {carelesswhisper} - so here it is.

Andrew and George.

#RStats #GuiltyFeetHaveGotNoRhythm

Mark van der Loo
7 hours ago

New #rstats package tinytest2JUnit exports tinytest results for integration with CI/CD workflow. Thanks to @openanalytics 👍

https://cran.r-project.org/web/packages/tinytest2JUnit/

CRAN Package Updates
7 hours ago

CRAN removals: BMRSr GapAnalysis multilevelcoda RMixtComp spatPomp #rstats

Gönczi Pál
7 hours ago

My mood today. Even #rstats can't help.

aRtsy_package
8 hours ago

Today's artwork generated with #rstats and #ggplot2:

CRAN Package Updates
8 hours ago

CRAN updates: microeco #rstats

CRAN Package Updates
9 hours ago

CRAN updates: lrstat #rstats

CRAN Package Updates
10 hours ago

CRAN updates: pm3 RESI snvecR #rstats

MilesMcBain
10 hours ago

In all likelihood what they forgot, and continue to forget, to teach you about R is {targets}.

But am I heartened that there are 2 talks on the bill for #positconf2023 citing the most impactful #rstats tool since ggplot! Bravo.

coolbutuseless
12 hours ago

The {careless whisper} R pkg ships with just the tiniest model. This is fast, and good for proof-of-concept speech-to-text.

If you're interested in good speech-to-text, then please download & try one of the larger whisper models.

They range up to 4GB in size, but even some of the medium sized models are very good.

Notes:
* If you're only interested in the English language, then use one of the "english only" models.
* Larger models are slower!

Models: https://ggml.ggerganov.com/

#RStats

coolbutuseless
12 hours ago

@rmflight I'm looking forward to see what you've done!

I'll look into the streaming interface at some stage and figure out whether it could be nicely supported in #RStats

joranelias
12 hours ago

Am I the only #rstats gt user that constantly wants the location rows selectors to operate as expressions that are evaluated on each selected column independently, ie ‘~ .x < 50’ to select cells less than 50 in each column separately?

rstatspkgbot
13 hours ago

📦 tidyfst
📝 Tidy Verbs for Fast Data Manipulation

🔗 https://cran.r-project.org/web/packages/tidyfst/index.html

🤖#RStats

CRAN Package Updates
14 hours ago

CRAN updates: boinet grates #rstats

jeremy_data :rstats:
14 hours ago

While sin is a made-up concept, the thing Christians say about sin is 100% true when applied to working with dates in #datascience:

"Sin will take you farther than you want to go, keep you longer than you want to stay, and cost you more than you want to pay."

#rstats #data #datadon #KillingMeSoftlyWithDates

CRAN Package Updates
15 hours ago

CRAN updates: admiral api2lm BFS crossval PaRe RcppJagger #rstats

CRAN Package Updates
17 hours ago

CRAN updates: altdoc #rstats

coolbutuseless
18 hours ago

@rmflight miniaudio has facilities for WAV and mp3 decoding, so it's conceivable that you could do something like this.

ctx <- whisper_init()
whisper_stream(ctx, "sound_file.mp3")

And this would internally decode/tranlsate on the fly (without decoding the audio into a single gigantic sound file)

#RStats

CRAN Package Updates
18 hours ago

CRAN updates: historicalborrowlong #rstats

Cédric Scherer
19 hours ago

📢 I am going to teach a new two-part workshop series in collaboration with Pearson × OReilly Media:

"Hands-On Data Visualization with ggplot2" 📈👩‍💻🧑‍💻

Follow the first session "Concepts" next Tuesday
📅 June 13, 12pm-3pm EST

👉 https://www.oreilly.com/live-events/hands-on-data-visualization-with-ggplot2-concepts/0636920089879/

#rstats #tidyverse #ggplot2 #dataviz #datavis #datavisualization

CRAN Package Updates
19 hours ago

CRAN updates: gifski #rstats

CRAN Package Updates
19 hours ago

New CRAN package texor with initial version 1.0.1
#rstats
https://cran.r-project.org/package=texor

CRAN Package Updates
19 hours ago

New CRAN package ProFAST with initial version 1.2
#rstats
https://cran.r-project.org/package=ProFAST

Elio Campitelli
19 hours ago

Because procrastination and because knitr makes it dead easy (yay #RStats!), I rendered a gitbook version of my second paper:

https://eliocamp.github.io/publication/shceof/gitbook/abstract.html

briandconnelly
1 day ago

Now on CRAN! {ami} (“am I?”) is a unified collection of lightweight checks that can be used to better understand the environments in which your #rstats code is running.

Physalia-courses
1 day ago

📢Calling all biologists and ecologists!
Join our Introduction to Statistics in R course with @PhilipLeftwich in July. Unlock the mysteries of R & the world of statistical analysis, gaining confidence in your skills!! https://physalia-courses.org/courses-workshops/course13/ 📊
#Statistics #Rstats #DataScience

Sharon Machlis
1 day ago

The {airnow} #rstats 📦 lets you query & retrieve air quality information from the US government's AirNow API. “Current and historical readings as well as forecasts can be retrieved as tidy data frames.” By @briandconnelly . On CRAN.

https://briandconnelly.github.io/airnow/

Of special interest to those of us in the Northeast US at the moment!
#AirQuality #WildFires @rstats

Find the site with the lowest air quality near Washington state
library(airnow)
library(dplyr)

get_airnow_area(
  box = c(-125.394211, 45.295897, -116.736984, 49.172497),
  verbose = TRUE
) |>
  slice_max(order_by = aqi, n = 1) |>
  select(site_name, site_agency, latitude, longitude, aqi, datetime_observed)
#> # A tibble: 1 × 6
#>   site_name         site_agency        latit…¹ longi…²   aqi datetime_observed  
#>   <fct>             <fct>                <dbl>   <dbl> <int> <dttm>             
#> 1 Clarkston-13th St Washington Depart…    46.4   -117.    56 2022-10-31 13:00:00
#> # … with abbreviated variable names ¹​latitude, ²​longitude
Examples
Current air quality in Seattle
The AirNow API allows you to query air conditions either by ZIP code or latitude/longitude. Here, we’ll get the current conditions in Seattle by ZIP code:

library(airnow)

get_airnow_conditions(zip = "98101")
#> # A tibble: 2 × 11
#>   date_observed hour_obs…¹ local…² repor…³ state…⁴ latit…⁵ longi…⁶ param…⁷   aqi
#>   <date>             <int> <fct>   <fct>   <fct>     <dbl>   <dbl> <fct>   <int>
#> 1 2022-10-31             6 PST     Seattl… WA         47.6   -122. O3         27
#> 2 2022-10-31             6 PST     Seattl… WA         47.6   -122. PM2.5      30
#> # … with 2 more variables: category_number <int>, category_name <fct>, and
#> #   abbreviated variable names ¹​hour_observed, ²​local_time_zone,
#> #   ³​reporting_area, ⁴​state_code, ⁵​latitude, ⁶​longitude, ⁷​parameter
coolbutuseless
1 day ago

Note: I've only tested on macOS, and would be very interested to hear if this works on other platforms.

#RStats

coolbutuseless
1 day ago

Introducing {carelesswhisper} - automatic speech recognition in #RStats using whisper.cpp

Attached vid: Live capture of my R session: recording 2 seconds of audio and translating to text.

This app includes the smallest (70MB) multi-language model. It can translate other languages to English too!

Also includes built-in audio recording code based upon miniaudio.

Pkg should work out-of-the box. No dependencies. (in theory!)

https://github.com/coolbutuseless/carelesswhisper

#RStats #GuiltyFeetHaveGotNoRhythm

Live capture of R session showing automatic speech recognition with carelesswhisper package
Hannah Frick
1 day ago

When you check in on your #rstats package again and the code looks like an unwanted time capsule? Do some spring cleaning! The tidy team does it every year and because not everyone enjoys it as much as I do, we have a few things to make it more fun: @andyteucher wrote it all up, from dedicated time to helpful checklists (usethis::use_upkeep_issue() 😍 ) to tongue-in-cheek celebratory certificates at the end.
https://www.tidyverse.org/blog/2023/06/spring-cleaning-2023/

Daniel
1 day ago

I drafted a new vignette on "intersectionality analysis", using the MAIHDA framework with mixed models and {ggeffects}. The vignette not only shows how to quantify variation (inequalities), but also how to compare different groups at risk
https://strengejacke.github.io/ggeffects/articles/practical_intersectionality.html
Intersectionality analysis is a new approach in social epidemiology, which attempts to move away from looking at relevant social indicators in isolation and rather looks at effects of belonging to specific strata simultaneously. #rstats

aRtsy_package
1 day ago

Today's artwork generated with #rstats and #ggplot2:

R-Podcast (Eric) :pci:
1 day ago

For the first time in what feels like forever, I will live-stream @rstats Shiny app development as part of R/Medicine 2023! I'll create a brand-new #shiny app to interactively explore a licorice and gargling clinical study, powered by:

🦏 {rhino} framework @appsilon
🔍 Drill-down summary tables using {Tplyr} and {reactable}
⭐️ New dashboard capabilities from {bslib} @cpsievert @Posit

Come by and say hello!
📆 June 8th 1 PM EST / 7 PM CEST
📡 https://twitch.tv/rpodcast

#RStats #shiny #RMed23

T. Austin Brown
2 days ago

Was just playing around with my new program and made this chart. I still have a lot of bugs to work out before this is a functional part of my dissertation, but I still think it's pretty cool. The alt-text has more description than I can fit here. #rstats #ESG #finance #academia #law

Title: Number of ESG-related words per edgar filing.
Plot shows a number of company names on the x axis and the number of ESG words on the y axis. there pink represents filings in edgar for 2021, teal represents filings for 2022. circles represent 10-k filings, triangles represent 10-Q filings. Exelon corp, an energy company, has by far the greatest number of ESG words (not surprising, because energy companies have serious eco impact, and need more words to describe). The least number is with Apple, Inc. which sounds reasonable, when you think about them being a tech company, but on further reflection, Apple has a lot of supply chain and planned obsolescence problems, so they could really do better with their treatment and disclosures to shareholders--especially given that pollution risk can really come back to bite a company.
posit::glimpse()
2 days ago

Code review is an important part of software development!

@davis shares how #tidyverse made our process explicit in a code review principles guide.

Feel free to modify these principles for your own needs & we’d love to hear about it if you do!

https://www.tidyverse.org/blog/2023/06/code-review-principles/

#rstats #python

posit::glimpse()
2 days ago

Spring Cleaning applies not only to our homes, but also to the code that we maintain. 🌸

@andyteucher shares how the #tidyverse team tackles this together & shows a new feature in {usethis} that will help you organize your own Spring Cleaning!

https://www.tidyverse.org/blog/2023/06/spring-cleaning-2023/

#rstats

Matt Dray
2 days ago

Imagine #RStats blogging as a hobby, lol. Anyway, here's two more posts.
👮 Rectangularise Word tables extracted by {officer}: https://www.rostrum.blog/2023/06/07/rectangular-officer/
📊 Recreating a dataviz with #ggplot2: https://www.rostrum.blog/2023/05/10/spear-ggplot2/

The Office meme that says 'corporate needs you to find the difference between these two pictures. A crudely copy-pasted image of an original data visualisation by Mary Eleanor Spear is on the left and a recreation of it is on the right. They look pretty similar.
The 'draw the rest of the owl' meme. The title is 'how to draw an owl' but it's been scribble dout and replaced with comic sans text that says 'How to extract an R data.frame from a Word table'. There are two steps: 'draw some circles' and then 'draw the rest of the owl'. The text for these has been replaced with Comic Sans that reads 'let officer do all the hard work' and then 'overengineer an unecessary new function.'
Julia Silge
2 days ago

Join the SLC RUG right now to hear from @ivelasq3 about data visualization with :rstats:! #rstats

https://www.youtube.com/live/XEDOfUp_O7w

Sharon Machlis
2 days ago

“The {datawizard} #rstats 📦 (from the easystats ecosystem) has two very useful functions to deal with duplicates:

* data_duplicated: Extract all duplicates including the first, unlike duplicated() or dplyr::distinct()

* data_unique: by default selects the ‘best’ duplicate” - Rémi Thériault

https://easystats.github.io/datawizard/reference/data_duplicated.html

https://easystats.github.io/datawizard/reference/data_unique.html

#DataWrangling @rstats

Extract all duplicates
Source: R/data_duplicated.R
Extract all duplicates, for visual inspection. Note that it also contains the first occurrence of future duplicates, unlike duplicated() or dplyr::distinct()). Also contains an additional column reporting the number of missing values for that row, to help in the decision-making when selecting which duplicates to keep.
Keep only one row from all with duplicated IDs
Source: R/data_unique.R
From all rows with at least one duplicated ID, keep only one. Methods for selecting the duplicated row are either the first duplicate, the last duplicate, or the "best" duplicate (default), based on the duplicate with the smallest number of NA. In case of ties, it picks the first duplicate, as it is the one most likely to be valid and authentic, given practice effects.

Contrarily to dplyr::distinct(), data_unique() keeps all columns.

Usage
data_unique(
  data,
  select = NULL,
  keep = "best",
  exclude = NULL,
  ignore_case = FALSE,
  regex = FALSE,
  verbose = TRUE
)
DMONTAGNE
2 days ago

Dear #RStats community,
I'm preparing a presentation on #R and #Wikipedia. Do you know of any other interesting packages besides WikipediR, WikidataR and GlittR?

R-Ladies Rome
2 days ago

📌
@rladiesrome
is hosting two events this month:

1.- Data Science Best Practices
Speaker: Dr. Simina Boca
When: June 12, 2023 at 6.00 PM CET / 12.00 PM EDT
RSVP: https://bit.ly/444pL6X

2.- One Health and the Politics of Coronaviruses
Speaker: Dr. Laura Kahn
When: June 30, 2023 at 5.00 PM CET / 11.00 PM EDT
RSVP: https://bit.ly/43MM3Kg

#datascience #womenempowerment #rstats

https://www.meetup.com/rladies-rome/
Ewan Donnachie
2 days ago

The latest Excel blunder from Austria is a lesson in why we need professional data people.

There's a widespread expectation that anyone can take Excel and use it to do critical things with data.

But data people know that we need the right tools to make data processing verifiable (usually with code) and enable us to check that everything is as it should be (with unit tests or assertions).

And most of all, we need more #dataliteracy at every level.

#RStats

https://www.theregister.com/2023/06/06/austria_election_excel_blunder/

Thomas Lumley
2 days ago

In which I argue that inheritance is backwards for statistical methods objects #rstats
https://notstatschat.rbind.io/2023/06/07/blank-cheque-inheritance-and-statistical-objects/

Sharon Machlis
3 days ago

“The {marginaleffects} 📦 book is now online! 25 chapters on post-estimation analyses and interpretation with #Rstats. The 📖 is full of tutorials, case studies, tips, and technical notes. Please check it out and let us know how we can improve this resource” - @vincentab
https://vincentarelbundock.github.io/marginaleffects/

Sharon Machlis
3 days ago

This GitHub repo has a number of interactive Shiny apps for "self-discovery of statistical concepts and rules-of-thumb." By Devan Becker
https://github.com/DBecker7/DB7_TeachingApps
#rstats #RShiny #statistics @rstats

Tools
pnorm: Calculate and visualize normal probabilities.
pvalues: Calculate and visualize p-values (normal distr).
distrshapes: How parameters affect shape of continuous distributions.
distrshapes_disc: How parameters affect shape ofdiscrete distributions.
SimplePower: Calculate power for simple null/alt (normal distr).
Self-Discovery Apps
PoisBinApprox: Poisson (and normal) approximation to the binomial.
QQDistrFittting: QQ Plots to test distributional assumptions.
ScatterCorr: Scatterplots with a fixed correlation.
InfluentialPoint: Move around a point to see it's affect on the line.
MultipleRegression Penguins: See the effects of including/removing predictors.
SerialCorrelation: Demonstrate autocorrelation and the runs test.
MeanLessMeansLeft: Generate distribution with fixed mean and median.
DensHist: Compare density plots to histograms.
MeasureSpread: Fixed IQR and SD.
PoissonCatQuant: Barplots versus histograms.
ConditionalNormal2: Visualization of conditional distributions.
indep: Visualization of independence.
Screenshot of a Shiny app "Visualization of p-values" with a slider for Zobs value, additional choices, and a graph
Eric Lawton
3 days ago

>Rsearcher finds factors linked with chronic school absenteeism

Is an Rsearcher a researcher who uses #Rstats?
https://phys.org/news/2023-06-rsearcher-factors-linked-chronic-school.html

Rediscovering once again the thing where, if you want to generate a series of identically formatted ggplots with different data subsets you have to wrap the plot in `print()` to write to pdf within the `for()` loop #Rstats

Nicola Rennie
3 days ago

Keeping it simple and minimal with a slope chart for #TidyTuesday this week! Using data from @ourworldindata, I focused in on how coal production per capita has changed in different countries since 1960 🔥

Code: https://github.com/nrennie/tidytuesday/tree/main/2023/2023-06-06

#RStats #R4DS #DataViz

A slope chart with x-axis points at 1960, 1980, 2000, and 2020. The y-axis shows the percentage change in per capita coal production for different countries. Indonesia is highlighted as it shows a dramatic increase. Two other countries show a sharp increase in 2000, then a decrease.
Eamon
3 days ago

#RStats tip: I only recently learned about `Hmisc::smean.sd` (and friends), and their wrappers in `ggplot2::hmisc`, such as `ggplot2::mean_cl_boot()`. I'm still learning new ways to make {ggplot2} do my data munging for me.

Jenny Bryan
3 days ago

📦 usethis v2.2.0 is out! 📦

The theme of this release is "a year of miscellaneous maintenance" 😅 It's also the version most closely tied to the upcoming print publication of R Packages 2e. Finally, we welcome @andyteucher as a new author! #rstats

https://usethis.r-lib.org/news/index.html

Steffi LaZerte
3 days ago

I'm brainstorming for #RStats @rOpenSci Coworking themes to use in the future.

Some examples of things we've already done...

- Start Writing that Package!
- Getting Started with targets!
- Working with New R Users
- Setting Up Continuous Integration
- Checking Data with naniar, visdat, assertr, and skimr!
- Working with Taxonomic Lists

Any suggestions for future themes? Anything you might like to see?

@rstats

rOpenSci
3 days ago

🚀 [Blog post] Meeting the stars of the R-Universe. This month interview: an Open Source Project to Take Care of the Planet.

We learn about the PEcAn project, where they develop open source tools and models for climate change.

✍️ https://ropensci.org/blog/2023/06/06/r-universe-stars-4-en/

#rstats @rstats

Mike Mahoney
3 days ago

From the inbox: How can I get fold assignments from spatialsample?

https://www.mm218.dev/posts/2023-06-06-spatialsample_splits/

#rstats #rspatial #gischat

Eric Scott
3 days ago

Having learned programming mostly in #rstats, I realize that I have a very fuzzy mental model of what "compiling" code even means. Can someone point me to an explanation of what it means to "compile" or "build from source" for semi- or non-experts?

Physalia-courses
3 days ago

Unleash the power of reproducible workflows in R with @eliocamp and @paocorrales ! 🧩 Join our course to master collaboration, version control, and seamless document generation. Empower your data science journey with efficiency and reliability! 🚀💡
https://physalia-courses.org/courses-workshops/r-reproducibility/

#RStats #Reproducibility #DataScience

Vincent Arel-Bundock
3 days ago

The {marginaleffects} 📦 book is now online! 25 chapters on post-estimation analyses and interpretation with #Rstats. The 📖 is full of tutorials, case studies, tips, and technical notes. Please check it out and let us know how we can improve this resource vincentarelbundock.github.io/marginaleffects

Rita Giordano
3 days ago

I'm very pleased to announce the next speaker of the #RLadiesCambridge #dataviz lunch is @tanya_shapiro Tanya, an independent data consultant, will talk about interactive data visualisation with {ggiraph}. Join us on the 23rd of June at 12.30 pm BST/ 7.30 am EDT. #rstats
https://www.meetup.com/rladies-cambridge/events/293991363/

Maëlle Salmon
3 days ago

New blog post! Three useful (to me) #RStats patterns

☑️ utils::modifyList()
☑️ rlang::%||%
☑️ Base R Set Operations

https://masalmon.eu/2023/06/06/basic-patterns/

joranelias
3 days ago

Is it weird that one of the main sources of minor friction I experience when experimenting in #python coming from #rstats is the way it prints/formats objects at the command line. I’m sure some of it familiarity, but I find things slower to parse, visually.

rstatspkgbot
4 days ago

📦 tidyEmoji
📝 Discovers Emoji from Text

🔗 https://cran.r-project.org/web/packages/tidyEmoji/index.html

🤖#RStats

MilesMcBain
4 days ago

Starting to mix #tidymodels with #targets for the first time in #rstats... I'm interested to hear tips from anyone who's trodden this path before.

It seems that the tidymodels idea of making a large specification which is evaluated late in one large computation is kind of at odds with the value targets brings to caching intermediate steps?

Sharon Machlis
4 days ago

“The often-overlooked do.call() #rstats function is a powerful tool that allows you to dynamically call other functions, opening up a world of possibilities for code organization, reusability, and flexibility.” - @stevensanderson

https://www.spsanderson.com/steveondata/posts/2023-06-01/index.html

R-Ladies Rome
4 days ago

💜 @rladiesrome@bird.makeup is hosting:
👉 Data Science Best Practices
Speaker: @siminaboca

When: June 12th, 2023 - 6:00 PM CET / 12:00 PM EDT
RSVP: 🔗 https://www.meetup.com/rladies-rome/events/293609269/

#rstats #datascience #health #rladies #womenempowerment #onlinelearning #opensource

@rladiesglobal

Georgios Karamanis
5 days ago

Verified oldest people for this week's #TidyTuesday. Big thanks to Aryn Toombs for the tutorial on making a beeswarm chart with circle packing https://aryntoombs.github.io/tutorials/beeswarm.html

code: https://github.com/gkaramanis/tidytuesday/tree/master/2023/2023-week_22

#RStats #dataviz

Beeswarm chart showing the top verified oldest men and women. 61 of the top 100 verified oldest women and 51 of the top 100 verified oldest men have the US or Japan as their place of death or residence.
Jakub Nowosad
5 days ago

⚠️ rgdal, rgeos, and maptools won’t be available on CRAN after October 2023. ⚠️

What are the consequences of this change? What do you need to do as a user and as a developer?

Read my blog post at https://geocompx.org/post/2023/rgdal-retirement/

#rstats #rspatial #gischat #geocompx

Sharon Machlis
6 days ago

Video: Intro to Version Control using git and RStudio - presentation by Ryan Johnson, data science advisor at @Posit, at @NHSrCommunity
https://youtu.be/qNMOPWT8jSo

#rstats #git #RStudio @rstats

Title slide: Intro to Version Control (using) git and RStudio
joranelias
1 week ago

I still haven’t seen anything to disprove the best description of R vs Python for data/stats that I’ve seen:

Python is an elegant, well-designed language with a confusing, oddly designed data DSL bolted onto it & R is an elegant, well designed data DSL with a confusing, oddly designed programming language built around it. #rstats

jonthegeek
1 week ago

Is there a web API that you'd love to use with R, if only it weren't so painful? Or perhaps you're using one, but aren't sure if it would make sense as a package. Please let me know here! https://forms.gle/CJz12TzzHkGsnQma9

#RStats

I often see really bad statistical errors in #python analyses that I don't see in #rstats. I don't think it's necessarily an issue with the language as much as it is the population using it. People with a CS degree and little statistics background often use python, and statisticians often use R.

While I know it's a generalization, I can't help but see python as a sort of anti-shibboleth for good statistical analysis.

Maëlle Salmon
1 week ago

Remember @yabellini's plan to translate the whole internet, I mean, @rOpenSci multilingual publishing project? https://ropensci.org/multilingual-publishing/

You might find these #RStats packages useful:

🌐 {babeldown} https://docs.ropensci.org/babeldown/ for translating Markdown-based content via DeepL API;

🌐 {babelquarto} https://docs.ropensci.org/babelquarto/ for rendering multilingual Quarto books.

Feedback welcome! They're still rather experimental but we do use them. 🧪

Andrew Heiss :rstats:
1 week ago

{sf} is an S-tier #rstats package

Andrew Heiss :rstats:
1 week ago

New blog post! We're about to embark on an 18-state, 5,000+ mile road trip (😱), so I figured I'd visualize it with #rstats and {sf}! Here's how to automatically get geocoded location and routing(!) data from OpenStreetMap and make fancy maps with ggplot #dataviz #rspatial https://www.andrewheiss.com/blog/2023/06/01/geocoding-routing-openstreetmap-r/

Map of road trip route with crossed-through states highlighted
Map of road trip showing the distances between each stop
Screenshot of a map of the 5th day of the return trip
Road trip map faceted by direction
datamaps :rickwhoah:
1 week ago

@sharoz @JASPStats

that's not actually the same as a "wide" format, and in any case wide and long are most times perfectly interchangeable (see melt and dcast in #rstats)

in particular, wide format are quite easy at least to the eyes when treating time series data

@MagicTony @Bakeri666 @JASPStats

The software dictates the format, reshapes it as appropriate, and then feeds it into the analysis. For example, in #rstats, the afex library accepts long format data, aggregates any replicates, and runs an anova.

The bigger point is that a tool designed to alleviate the programming requirement of analyses shouldn't require major preprocessing like reshaping data.

Sharon Machlis
1 week ago

Ooo the {gt} #rstats 📦's new interactive option includes resizable columns with the op_interactive(use_resizers = TRUE ) argument.
Only thing missing for me in this early version is regular expression searching! (I do love that in {dt})
Looking forward to trying this:
https://posit.co/blog/new-in-gt-0-9-0-interactive-tables/

towny_tbl |>
  gt() |>
  opt_interactive(
    use_search = TRUE,
    use_filters = TRUE,
    use_resizers = TRUE,
    use_highlight = TRUE,
    use_compact_mode = TRUE,
    use_text_wrapping = FALSE,
    use_page_size_select = TRUE
  )
Table with headline "Population and Density Data", subhead "Arranged from largest to smallest municipality", columns that are sortable and filterable, and cells in the Population and Density columns that are styled with background colors depending on values.

The glossary #rstats package is now on CRAN!

Glossary is a lightweight solution for making glossaries in educational materials written in quarto or R Markdown. This package provides functions to link terms in text to their definitions in an external glossary file, as well as create a glossary table of all linked terms at the end of a section.

https://debruine.github.io/glossary/

Example of the glossary popup, showing the definition of "effect size".