Data Scientist (with R)
R news, data engineering, visualisation, spatial data
📰 R community news, events, and Posit updates
Weekly recap (Sep 19, 2025) (blog.stephenturner.us). AI-generated genomes, biosafety, mirror life, AI in teaching, CRAN defense, Posit news, and OpenAI ChatGPT usage insights
Generative AI and R workshops in Hobart Australia (seascapemodels.org). Two-day in-person R and AI workshop in Hobart (Nov 11–12, 2025) covering AI-assisted coding, GLMs in R, with Chris Brown and Anthony Richardson
R Weekly 2025-W39 Parse RMarkdown, Vibe Code, Quality Control (rweekly.org). Weekly news in the R Community: Parse RMarkdown/Quarto, Vibe Code, quality control, and package updates
posit::conf(2025) - Atlanta, USA (ropensci.org). ROpenSci posit::conf(2025) in Atlanta features lightning talks and sessions on R ecosystem, Positron from RStudio, targets, LLM productivity, and community building
Positron at posit::conf(2025) Roundup (posit.co). Positron at posit::conf(2025) Roundup highlights Positron launch, Python/R integration, centralized management, package repositories, and Shiny apps via Posit platform
🛠️ Data engineering, cleaning, and reproducibility in R
Jumping in the Ducklake with nothing but R on (discindo.org). Explores using DuckLake with R and DuckDB to implement medallion-style pipelines (bronze/silver/gold), writing data, lineage tracking, and visualization
Research note on the Radical Right Research Robot (kai-arzheimer.com). Autonomous Radical Right Research Robot (RRRR) in R, posting references on radical right literature and shifting to Mastodon/Bluesky amid Twitter changes
All the Ways to Programmatically Edit or Parse R Markdown / Quarto Documents (ropensci.org). Overview of R Markdown/Quarto parsing and editing in R, highlighting tinkr, md4r, Pandoc, parseqmd, parsermd, lightparser, string utilities, and frontmatter handling
Easily clean up messy databases with fuzzy matching in R (storybench.org). Fuzzy matching in R with stringdist to clean messy session names on an IRE 2025 schedule
Bug affecting neonUtilities in latest RStudio version on Windows (neonscience.org). Windows users of neonUtilities advised not to update RStudio 2025.09 due to downloader package conflict
🎨 Visualization, ggplot2, and publication-ready tables in R
Aesthetics Evaluation Control in ggplot (jmsallan.netlify.app). A practical guide to controlling aesthetic evaluation in ggplot2 using after_stat and related stats
How many cars are there in Madison? (haraldkliems.netlify.app). Car ownership in Madison rises with households; ACS data visualized in R
Recreating APA Manual Table 7.12 in R with apa7 (wjschne.github.io). Recreating APA Table 7.12 in R using apa7, flextable, ftExtra, and tidyverse with simulated data and aov analyses
Recreating APA Manual Table 7.7 in R with apa7 (wjschne.github.io). Recreating APA Table 7.7 in R using apa7, flextable, ftExtra, tidyverse, and chi-square calculations
🗺️ Spatial data, mapping, and geospatial workflows in R
Artpack 0.2.0 (thetidytrekker.com). Geospatial pointinpolygon with sf, group tools, seqbounce, resizer, setbrightness, set_saturation, and ggplot2 visualizations for artpack 0.2.0
Phoenician colonization (r.iresmi.net). Reconstructs Phoenician colonization data via CSVs, tidyverse in R, parzer, sf, and leaflet
Drop #713 (2025-09-19): AVAST ME HEARTIES! (dailydrop.hrbrmstr.dev). Using R and DuckDB to access and process Maritime Safety Information and piracy data from ASAM/GeoJSON sources
Connecting the dots with R (aliceinstatisticsland.wordpress.com). Live stream of Ihaka lecture on spatial statistics, spatstat, ppm(), and spatial modelling with R
🧠 Modeling packages, explainability, and applied analyses in R
Frequently Asked Questions (metafor-project.org). Overview of metafor package, validation, funding, citing, comparisons with other software, and technical details on I2/H2, R2, prediction intervals, transformations, and Mantel-Haenszel results
Replication Forensics: A Learning Experience for Students (svmiller.com). Replication forensics in R: exploring Benoit 1996 data, WEEDE.ASC handling, and converting to modern formats with dplyr, readtable, and readfile
Key improvements in shapviz and kernelshap (lorentzen.ch). Shapviz and kernelshap updates with GLM and XGBoost SHAP explanations and interactions
Chess Dreams and Breakthroughs: A Global Perspective (stevenponce.netlify.app). Global chess ratings analysis reveals patterns in activity, breakthroughs, and federations with FIDE data
📈 Statistical inference, probability, and causal analysis with R
T test in R (codingthepast.com). T test in R: perform t.test, bootstrap approach, Titanic data, and bootstrap-based p-values in R
Some notes on probability judgement (blog.djnavarro.net). Calibrations of human probability judgments using YouGov data; explains 21% trans figure via Tversky–Kahneman heuristics and simple error models in R
Causal Inference in R (lucymcgowan.com). Causal diagrams, propensity scores, and inverse probability weighting in R for causal questions using tidyverse tools
Generating Synthetic Data with R-vine Copulas using esgtoolkit in R (thierrymoudiki.github.io). Tutorial on generating synthetic data with R-vine copulas using esgtoolkit in R and RVineModel fitting
Week 3, days 5 and 6, plus a story about whale watching (causalinf.substack.com). Harvard lecturer blogs about classroom experiences, causal inference lessons, local Boston trip, motion sickness on a boat, and personal growth
Analysis of Sales Shift in Retail with Causal Impact: A Case Study at Carrefour (towardsdatascience.com). Causal Impact analyzes sales shift after product unavailability using Bayesian structural time-series with covariates and a synthetic control in Carrefour case study
Estimating rare proportions (statschat.org.nz). Estimating rare proportions and biases in small vs large proportions in surveys, with references to Navarro and Gelman
📚 Academic Research
Efficient and Accessible Discrete Choice Experiments: The DCEtool Package for R (arxiv:econ). DCEtool: an R package with a Shiny interface for efficient, accessible discrete choice design, decoding, and analysis
hi