The R Data Scientist logo

The R Data Scientist

Subscribe
Archives
September 23, 2025

Data Scientist (with R)

R news, data engineering, visualisation, spatial data

📰 R community news, events, and Posit updates

Weekly recap (Sep 19, 2025) (blog​.stephenturner​.us). AI-generated genomes, biosafety, mirror life, AI in teaching, CRAN defense, Posit news, and OpenAI ChatGPT usage insights

Generative AI and R workshops in Hobart Australia (seascapemodels​.org). Two-day in-person R and AI workshop in Hobart (Nov 11–12, 2025) covering AI-assisted coding, GLMs in R, with Chris Brown and Anthony Richardson

R Weekly 2025-W39 Parse RMarkdown, Vibe Code, Quality Control (rweekly​.org). Weekly news in the R Community: Parse RMarkdown/Quarto, Vibe Code, quality control, and package updates

posit::conf(2025) - Atlanta, USA (ropensci​.org). ROpenSci posit::conf(2025) in Atlanta features lightning talks and sessions on R ecosystem, Positron from RStudio, targets, LLM productivity, and community building

Positron at posit::conf(2025) Roundup (posit​.co). Positron at posit::conf(2025) Roundup highlights Positron launch, Python/R integration, centralized management, package repositories, and Shiny apps via Posit platform


🛠️ Data engineering, cleaning, and reproducibility in R

Jumping in the Ducklake with nothing but R on (discindo​.org). Explores using DuckLake with R and DuckDB to implement medallion-style pipelines (bronze/silver/gold), writing data, lineage tracking, and visualization

Research note on the Radical Right Research Robot (kai-arzheimer​.com). Autonomous Radical Right Research Robot (RRRR) in R, posting references on radical right literature and shifting to Mastodon/Bluesky amid Twitter changes

All the Ways to Programmatically Edit or Parse R Markdown / Quarto Documents (ropensci​.org). Overview of R Markdown/Quarto parsing and editing in R, highlighting tinkr, md4r, Pandoc, parseqmd, parsermd, lightparser, string utilities, and frontmatter handling

Easily clean up messy databases with fuzzy matching in R (storybench​.org). Fuzzy matching in R with stringdist to clean messy session names on an IRE 2025 schedule

Bug affecting neonUtilities in latest RStudio version on Windows (neonscience​.org). Windows users of neonUtilities advised not to update RStudio 2025.09 due to downloader package conflict


🎨 Visualization, ggplot2, and publication-ready tables in R

Aesthetics Evaluation Control in ggplot (jmsallan​.netlify​.app). A practical guide to controlling aesthetic evaluation in ggplot2 using after_stat and related stats

How many cars are there in Madison? (haraldkliems​.netlify​.app). Car ownership in Madison rises with households; ACS data visualized in R

Recreating APA Manual Table 7.12 in R with apa7 (wjschne​.github​.io). Recreating APA Table 7.12 in R using apa7, flextable, ftExtra, and tidyverse with simulated data and aov analyses

Recreating APA Manual Table 7.7 in R with apa7 (wjschne​.github​.io). Recreating APA Table 7.7 in R using apa7, flextable, ftExtra, tidyverse, and chi-square calculations


🗺️ Spatial data, mapping, and geospatial workflows in R

Artpack 0.2.0 (thetidytrekker​.com). Geospatial pointinpolygon with sf, group tools, seqbounce, resizer, setbrightness, set_saturation, and ggplot2 visualizations for artpack 0.2.0

Phoenician colonization (r​.iresmi​.net). Reconstructs Phoenician colonization data via CSVs, tidyverse in R, parzer, sf, and leaflet

Drop #713 (2025-09-19): AVAST ME HEARTIES! (dailydrop​.hrbrmstr​.dev). Using R and DuckDB to access and process Maritime Safety Information and piracy data from ASAM/GeoJSON sources

Connecting the dots with R (aliceinstatisticsland​.wordpress​.com). Live stream of Ihaka lecture on spatial statistics, spatstat, ppm(), and spatial modelling with R


🧠 Modeling packages, explainability, and applied analyses in R

Frequently Asked Questions (metafor-project​.org). Overview of metafor package, validation, funding, citing, comparisons with other software, and technical details on I2/H2, R2, prediction intervals, transformations, and Mantel-Haenszel results

Replication Forensics: A Learning Experience for Students (svmiller​.com). Replication forensics in R: exploring Benoit 1996 data, WEEDE.ASC handling, and converting to modern formats with dplyr, readtable, and readfile

Key improvements in shapviz and kernelshap (lorentzen​.ch). Shapviz and kernelshap updates with GLM and XGBoost SHAP explanations and interactions

Chess Dreams and Breakthroughs: A Global Perspective (stevenponce​.netlify​.app). Global chess ratings analysis reveals patterns in activity, breakthroughs, and federations with FIDE data


📈 Statistical inference, probability, and causal analysis with R

T test in R (codingthepast​.com). T test in R: perform t.test, bootstrap approach, Titanic data, and bootstrap-based p-values in R

Some notes on probability judgement (blog​.djnavarro​.net). Calibrations of human probability judgments using YouGov data; explains 21% trans figure via Tversky–Kahneman heuristics and simple error models in R

Causal Inference in R (lucymcgowan​.com). Causal diagrams, propensity scores, and inverse probability weighting in R for causal questions using tidyverse tools

Generating Synthetic Data with R-vine Copulas using esgtoolkit in R (thierrymoudiki​.github​.io). Tutorial on generating synthetic data with R-vine copulas using esgtoolkit in R and RVineModel fitting

Week 3, days 5 and 6, plus a story about whale watching (causalinf​.substack​.com). Harvard lecturer blogs about classroom experiences, causal inference lessons, local Boston trip, motion sickness on a boat, and personal growth

Analysis of Sales Shift in Retail with Causal Impact: A Case Study at Carrefour (towardsdatascience​.com). Causal Impact analyzes sales shift after product unavailability using Bayesian structural time-series with covariates and a synthetic control in Carrefour case study

Estimating rare proportions (statschat​.org​.nz). Estimating rare proportions and biases in small vs large proportions in surveys, with references to Navarro and Gelman


📚 Academic Research

Efficient and Accessible Discrete Choice Experiments: The DCEtool Package for R (arxiv:econ). DCEtool: an R package with a Shiny interface for efficient, accessible discrete choice design, decoding, and analysis

hi

Don't miss what's next. Subscribe to The R Data Scientist:
Start the conversation:
Bluesky Mastodon LinkedIn
Powered by Buttondown, the easiest way to start and grow your newsletter.