#datascience
Our yearly #SciComp / # HPC Kickstart course starts tomorrow. It's #livestream on EU-afternoons, so the whole world is invited! (6-8 June 2023, starting 10:50 CEST, 4hr/day) #RSEng #DataScience #Computing
Day 1: general #SciComp basics + connecting, Days 2-3: #HPC usage
BenchSci is hiring a Senior Data Engineer in #Toronto! Apply today at https://www.datasciencejobscanada.com/jobs/senior-data-engineer-benchsci-7d3e5. #datascience #datasciencejobs #MLjobs

Do you burn more calories running than walking?
If your exercise is time-constrained, you'll (obviously) burn more calories the faster you go whether you run or walk. If your exercise is distance-constrained, you'll burn slightly more calories running than walking.
https://blog.kamens.us/2023/06/04/do-you-burn-more-calories-running-than-walking/
New Newsletter 📥 Get the inside scoop on GPT-4 & PaLM 2. Unpack the intricacies of these foundation models and understand the evolution of #LLMs
#NLproc #MachineLearning #DataScience
https://gradientflow.substack.com/p/what-you-need-to-know-about-gpt-4
I'm privileged to announce that I've just published the 3rd part of my blog on 'Computational Theories of Cognition: Models of Causal Inference' on @medium. I did my best cover as much as possible. Comments are highly welcome!
#cognitivescience #linguistics #computation #knowledgegraphs #reasoning #datascience #ai #blog #medium

Clari is hiring a Data Engineer II in #Seattle! Apply today at https://www.datasciencejobsusa.com/jobs/data-engineer-ii-clari-8c0e9. #datasciencejobs #datascientistjobs #datascience #aijobs #mljobs #machinelearning #engineeringjobs #Hiring #HiringAlert #USA #ML #AI
Great Australian Pods Podcast Directory: https://www.greataustralianpods.com/2023/06/making-data-better/ #GreatAusPods #AusPods #Podcast #Australia #Technology #DataScience #Tech #Cyber

AI Data Scientist
Lunenfeld-Tanenbaum Research Institute
See the full job description on jobRxiv: https://jobrxiv.org/job/lunenfeld-tanenbaum-research-institute-27778-ai-data-scientist/?feed_id=47138
#ScienceJobs #hiring #research #AI, #Datascience, #Statistics #Epidemiology #MachineLearning #MedicalImaging #Cancer #radiomics
Toronto #Canada #Researcher #Scientist
https://jobrxiv.org/job/lunenfeld-tanenbaum-research-institute-27778-ai-data-scientist/?feed_id=47138
The Chair of Humanities Data Science and Methodology works at the intersection of historical research, oral history, digital cultural heritage and the digital humanities.
Follow here: https://mastodon.social/@HDSM
BenchSci is hiring a Senior Data Engineer in #Vancouver! Apply today at https://www.datasciencejobscanada.com/jobs/senior-data-engineer-benchsci-2d854. #datascience #datasciencejobs #MLjobs

Do you burn more calories if you walk faster?
If your walk is time-constrained, walking faster burns more calories. If your walk is distance-constrained, walking faster makes no difference.
https://blog.kamens.us/2023/06/03/do-you-burn-more-calories-if-you-walk-faster/
New Newsletter 📥 Get the inside scoop on GPT-4 & PaLM 2. Unpack the intricacies of these foundation models and understand the evolution of #LLMs
#NLproc #MachineLearning #DataScience
https://gradientflow.substack.com/p/what-you-need-to-know-about-gpt-4
The #Linux Systems to get you going!
These just-works Linux systems featuring Kubuntu 22.04 LTS and the beautiful and intuitive #KDE desktop are meant to help you with any task and make it fun too!
See more at: https://kfocus.org/
#Code #Data #Developers #ML #Engineers #Laptop #Desktop #AI #Creators #DataScience #Graphics

I'm looking forward to speak about @ApacheGroovy
and Apache Ignite at the Ignite Summit next week. Clustering Whiskey profiles in a cluster! Machine learning at scale!
Find out more details here:
https://ignite-summit.org
#groovylang #datascience #machinelearning

Clari is hiring a Data Engineer II in #Remote! Apply today at https://www.datasciencejobsusa.com/jobs/data-engineer-ii-clari-d3820. #datasciencejobs #datascientistjobs #datascience #aijobs #mljobs #machinelearning #engineeringjobs #Hiring #HiringAlert #USA #ML #AI
BenchSci is hiring a Senior Machine Learning Engineer in #Toronto! Apply today at https://www.datasciencejobscanada.com/jobs/senior-machine-learning-engineer-benchsci-6e2e4. #datascience #datasciencejobs #MLjobs

Just 2 weeks left to grab a ticket to our Health Hack weekend with NHS Grampian and University of Aberdeen.
Loads of challenges to work on in small teams. All skills needed. https://ti.to/code-the-city/ctc29 #health #datascience #ai #innovation

📝 "MLOps made simple: how to run a batch prediction pipeline using Azure Machine Learning components"
👤 Déborah Mesquita
#pyladies #python #datascience #mlops #azuremachinelearning #azure
This Thursday, join us for a workshop on visualizing data with #python, #streamlit, and #plotly. We'll have facilitators from the University of #Wisconsin #DataScience Hub lead us in a participatory workshop, building and customizing a demo app deployed onto the cloud. Bring your laptop!
https://www.meetup.com/madison-python/events/293265891/
Meeting at the #Madison library central branch. (No pizza this time. Sorry!)
Saberseminar scholarship applications are due MONDAY June 5th! #baseball #SportsAnalytics #DataScience
Scholarships | Sabermetrics Scouting and the Science of Baseball http://www.saberseminar.com/scholarships/
Greetings from #Scotland! 🏴
Professional developer, #DataScience newbie. Considering changing careers. Hope to go through a #DataAnalysis bootcamp later in the year. Will use this account for recording learning. Love to find #Scottish datasets
#Introduction
The ChatGPT plugin of Noteable is magic. It took me seconds to build an optimized and reliable machine learning model with #rstats and #tidymodels. "A notitia physicus lupus alii notitia physicus". I have spoken.
https://app.noteable.io/f/66fe418e-aceb-419d-b1f8-17eb2eb1be04/ChatGPT-Plugin-for-Jupyter-Notebooks.ipynb #datascience
BenchSci is hiring a Senior Machine Learning Engineer in #Toronto! Apply today at https://www.datasciencejobscanada.com/jobs/senior-machine-learning-engineer-benchsci-d24e5. #datascience #datasciencejobs #MLjobs

Join us Thursday, June 8, at noon CT/1 pm ET for a special webinar, “New BIG data resources on St. Jude Cloud.” Panelists from St. Jude will discuss and perform a live demo of the Pediatric Cancer Knowledgebase (PeCanV2) and Survivorship Portal. Register today. https://bit.ly/SOCC23-MAS
#SOCC23 #ChildhoodCancer #BigData #DataScience

Challenge your assumptions about numbers and understanding 🤔🧐
with Climateer's thought-provoking article.
Explore the limitations of relying on numbers as shortcuts to understanding complex systems, and learn how to think critically and holistically about the issues facing our world today.
My newsletter subscribers learned about this 9 months ago!
https://dramsch.net/newsletter
Check it out here:
https://climateer.substack.com/p/numbers

New Newsletter 📥 Get the inside scoop on GPT-4 & PaLM 2. Unpack the intricacies of these foundation models and understand the evolution of #LLMs
#NLproc #MachineLearning #DataScience
https://gradientflow.substack.com/p/what-you-need-to-know-about-gpt-4
Clari is hiring a Principal Data Engineer in #Seattle! Apply today at https://www.datasciencejobsusa.com/jobs/principal-data-engineer-clari-bdf9b. #datasciencejobs #datascientistjobs #datascience #aijobs #mljobs #machinelearning #engineeringjobs #Hiring #HiringAlert #USA #ML #AI
✅ Sign up, Q&A 🎤, international 🌐: What skills are needed for #DataScience #AI to fully deliver in digital humanities, linguistics, museums, libraries, collections, natural history and more? Tues 6 June 18:00 BST host Alan Turing Institute and @LivingWithMachines https://livingwithmachines.ac.uk/event/ai-beyond-stem-digital-skills-to-unleash-the-power-of-data-science-and-ai-for-all/

I've been building and playing with my own #neuralnetworks in #python recently. My favourite has got to be the self-organising map #SOM.
https://jrashford.com/2023/06/02/how-self-organising-maps-work-explained-with-graphics/

Find jobs fast at https://www.datasciencejobscanada.com ! #datascience
New Newsletter 📥 Get the inside scoop on GPT-4 & PaLM 2. Unpack the intricacies of these foundation models and understand the evolution of #LLMs
#NLproc #MachineLearning #DataScience
https://gradientflow.substack.com/p/what-you-need-to-know-about-gpt-4
Episode 124 of the @rstats @rweekly Highlights Podcast is out now! https://podverse.fm/episode/yWDOzHEVh
🧩 Curried functions in R with {purrr} partials (Michael Decrescenzo)
📦 {fusen} simplifying writing packages @statnmap @RConsortium
🗺️ Parliament proportions @peter_ellis
Happy with your current podcast app but want to send a boost? You can do that directly on the Podcast Index! Find us at https://podcastindex.org/podcast/1062040
🚀Join our immersive 3-day course and master R, #tidyverse, and #ggplot2 to create stunning scientific plots. Enhance your skills, communicate impactfully, and let your data tell a captivating story. Don't miss out: https://physalia-courses.org/courses-workshops/ggplot23/
Early appearance on the #bbcbreakfast sofa this morning with Charlie and Nina talking about #UnidentifiedAnomalousPhenomena (#uap aka #ufo). A #nasa Independent Study Team had its first public meeting yesterday. It's not investigating UAP reports, but is recommending that the public report sightings without stigma, that #nasa and other bodies bring together relevant datasets , and that new #datascience approaches such as #machinelearning be used. Recommendations will be published next month.


Kedro 0.18.9 is out! 🔶
We added support for a metadata attribute in datasets, introduced a new `kedro.logging.RichHandler` that is more flexible and configurable, fixed some bugs with `OmegaConfigLoader`, and made substantial improvements to our deployment docs.
Install it now with pip or conda:
```
pip install kedro==0.18.9
conda install -c conda-forge kedro==0.18.9
```
And read the complete release notes online: https://github.com/kedro-org/kedro/releases/tag/0.18.9
👉 Data Science Best Practices 🥰
@rladiesrome is hosting a new event on June 12 at 6pm CEST/ 12pm EDT
🗣️@siminaboca@bird.makeup between the Celebrating Women in Statistics 2021 (🔗bit.ly/3IRji6T) will be our speaker
When you are starting a new #datascience project but you have to use windows and aren't allowed docker. And it crashes (see windows) and you forget to switch back to your environment to install the rest of the packages and mess up #anaconda base:
(Alt text here otherwise it deletes gif- Elen Rippley from alien: I say we take off and nuke it from orbit... It's the only way to be sure)

There are now less than three weeks until our Health Hack weekend where we will be working with NHS Grampian and University of Aberdeen on some real world challenges.
With 30+ attendees already signed up, this promises to be one of our most impactful and enjoyable sessions for some time.
More details of the event, the challenges, and how to get a ticket: https://codethecity.org/ctc29/
#health, #datascience #civtech #techforgood #innovation #collaboration

How we collect information matters as does how we analyze, share, and build upon it.
When people voluntarily give us data, they are giving us their trust.
#Data #DataScience #Ethics #SocialJustice #Equality
https://www.juliaferraioli.com/blog/2023/influential-articles-may/
kedro-datasets 1.4.0 is out! 🔶 With a new SparkStreamingDataSet!
kedro-datasets is a separate PyPI package where Kedro datasets live. ⚠️ Notice that `kedro.extras.datasets` is deprecated and will be removed in Kedro 0.19, so install the new package now!
```
pip install "kedro-datasets==1.4.0"
```
T-Mobile is using #AI to fight #churn - this article also has #churnrate stats for #mobilephone : T-Mobile .89%, Verizon .84%, AT&T .81%. #machinelearning #datascience #analytics
https://www.phonearena.com/news/t-mobile-ai-will-reduce-churn_id147823
In unserem Format "Drei Fragen an..." stellen wir diese diesmal an Dr. @oliver_karras, Post-Doc und #DataScientist in der #TIB-Forschungsgruppe #DataScience and #DigitalLibraries.
Das komplette Interview über #KünstlicheIntelligenz (#KI, #AI), #ChatGPT, #Wissensgraph|en und die Beantwortung wissenschaftlicher Fragen findet ihr hier: https://blogs.tib.eu/wp/tib/2023/05/26/drei-fragen-an-dr-oliver-karras/
📢 Don't miss our #DataScience talk by Andreas Baumann from @DH_UniWien about the origins of #SemanticDiversity in language and #LanguageEvolution on 📅 Mon 12 June @ 14:00 CEST on-site @univienna or online #Zoom
https://datascience.univie.ac.at/dsunivie-talks/about/news/andreas-baumann-thats-the-key-modeling-the-effects-of-non-conformist-behavior-and-perceptual-gr/

Check out Cassie Kozyrkov's "Stats Gist List," 📊📚
an irreverent statistician's guide to jargon
From "p-value" to "overfitting," gain a deeper understanding of key statistical concepts in an engaging and accessible way.
My newsletter subscribers learned about this 13 months ago!
https://dramsch.net/newsletter
Check it out here:
https://towardsdatascience.com/stats-gist-list-an-irreverent-statisticians-guide-to-jargon-be8173df090d

Tutorial for selecting the number and labels of topics in topic modeling: https://journals.sagepub.com/doi/full/10.1177/25152459231160105
#researchmethods #research #topicmodeling #datascience #psych #newpsychresearch #socpsych #personaliy #methods



Can you answer this week's interview question? 🤔
What is the F1-score and when would you choose it over other metrics?
Leave your answer below! 👇
Get the answer next week here: https//dramsch.net/newsletter

We have two open professorships in #MachineLearning and #DataScience | #DataAnalytics at the University of Graz in the newly founded interdisciplinary Idea_Lab
Great colleagues & quality of life
This was a fascinating, illuminating, and enraging conversation with Dr Richard Denniss, Executive Director of The Australia Institute, and Professor Margaret Hellard, Deputy Director of the Burnet Institute, about why we're not collecting covid data, and why it matters. This one is really important, so please listen, and share.
RD:"Why aren't we collecting the data? Because they don't want to admit failure. They don't want to make it easy for me to tell you what the cost to GDP of this heroic approach to covid has been."
#data #covid #STEM #PublicHealth #DataScience #epidemiology #auspol
https://adsei.org/podcast/covid-data-why-arent-we-collecting-it-anymore/
Can you answer this week's interview question? 🤔
What is the F1-score and when would you choose it over other metrics?
Leave your answer below! 👇
Get the answer next week here: https//dramsch.net/newsletter
Datasci.social is a server for researchers & practitioners in human-centric data science, broadly defined, like network science, computational social science, geospatial data science:
:Fediverse: https://datasci.social
For more info see their About page at https://datasci.social/about or ask their admin @mszll
#FeaturedServer #DataScience #DataSci #NetworkScience #Networks #ComputationalSocialScience #SocialScience #DataViz #Geospatial #GeospatialDataScience #Data #Science #Academic #Fediverse
#WHO #PublicHealth #OpenData #BigData #DataScience
'Starting with the data underlying WHO's annual World Health Statistics report, the new website reimagines the indicator page – the most representative level of data presentation – with consistent, expressive and accessible visualization, while also presenting metadata to promote ease of accessibility, reference and use.'
https://www.who.int/news-room/feature-stories/detail/who-releases-data.who.int
Episode 123 of the @rstats @rweekly Highlights Podcast blends nicely with the R community! https://podverse.fm/episode/DmcMBmplP
🕸 HTTP Testing with R @maelle @RConsortium
💹 Introducing {ggblend} @mjskay
📆 Handling dates in R & Excel @AbrahamsAmieroh@twitter.com @jumpingrivers
Happy with your current podcast app but want to send a boost? You can do that directly on the Podcast Index! Find us at https://podcastindex.org/podcast/1062040
h/t @mike_thomas @batool664@twitter.com 🙏
A whole new way to manipulate images!
Just "Drag your GAN"! 😍
This groundbreaking research paper introduces an interactive, point-based manipulation technique for GAN-generated images that's sure to blow your mind! 🚀
This looks so fun!
Join my newsletter, where I share these projects every week! https://dramsch.net/newsletter
🧠: https://vcai.mpi-inf.mpg.de/projects/DragGAN/
📝: https://arxiv.org/abs/2305.10973
💻: https://github.com/XingangPan/DragGAN (Coming Soon)
#machinelearning #datascience #python #deeplearning #career #tech
I have a figure made with the R package 'maps'.
The paper with that figure got accepted by a journal requiring a CCBy4 license.
What's the license of figures that we produce by using that R package? The code of the package is released under GPL3, but not clear to me if that applies also to the figures produced by me with that package.
#RStat #datascience #license #copyright
Ideas? @franco_vazza @tiago
My line manager is dead keen for me to use #Jupyter notebooks convinced they will solve any problem. I've used them before, I don't like them! #Python #DataScience
My main complaint is that execution order/state is not clear and Notebooks aren't particularly git friendly.
kedro-datasets 1.3.0 is out! 🔶
kedro-datasets is a separate PyPI package where Kedro datasets live. ⚠️ Notice that `kedro.extras.datasets` is deprecated and will be removed in Kedro 0.19, so install the new package now!
```
pip install "kedro-datasets==1.3.0"
```
Highlights: pandas 2.0 and SQLAlchemy 2.0 support, new `metadata` attribute, new ManagedTableDataSet for managed delta tables on Databricks, Polars 0.17 support, and more!
Video of my talk at CITP (Princeton):
'Digital Discrimination and the Law in Europe'
https://citp.princeton.edu/event/citp-seminar-zuiderveen-borgesius/
#AI #FAccT #bias #discrimination #tech #gdpr #privacy #data #dataprotection #datascience #machinelearning #aiact #law #politics
We don't have a Mastodon account (yet) but I think it's worth sharing that Open Data Scotland is hosting its inaugural hack weekend this coming Friday.
Come along if you want to find out more about our community and get involved!
📅 When: Friday 26th May, 18:00 - 22:00 (you don't have to stay for the whole time!)
🔗 Sign up now: https://opencollective.com/opendata_scot/events/hack-evening-0ca24921
#OpenData #Hackathon #Open #Data #Scotland #Event #DataScience
Wow - what a paper on the path(s) from data to insights!
"Variability in research outcomes between researchers can occur even under rigid adherence to the scientific method, high ethical standards, and state-of-the-art approaches to maximizing reproducibility."
https://www.pnas.org/doi/10.1073/pnas.2203150119
#DataScience #Science #reproducibility
This week https://datasci.social has turned 6 months old 🥳
We are 133 people now (68 active), with increased growth in the past few weeks. Our server hosts a community for #DataScience, broadly defined. See below our place in the universe of servers.
Our operation is financed via donations, we are grateful for support: https://community.datasci.social/docs/support/
Are you in Amsterdam and want to learn how to effectively create and run maintainable data pipelines in the cloud? 🔶
Join Xebia | Data for an in-person Kedro Code Breakfast at their offices next Tuesday, May 23rd 📅 Seats are limited!
https://events.xebia.com/code-breakfast-kedro/?utm_source=mastodon
Just four weeks unti our Health Hackathon on 17-18 June! Our number of attendees signed up has more than doubled in the last week. And even more challenges are being added each week.
https://codethecity.org/ctc29/
Thanks to NHS Grampian and University of Aberdeen for sponsoring the event and to Robert Gordon University for participating too. Also thanks to ONE Tech Hub for hosting us! See you there!
#health #hackathon #coding #datascience #AI #voicecontrol #LLM #appdevelopment
"Do not sit and be patient with a guy who just nuked 14 Earths to sell more cheeseburgers!"
Fast food and credit cards are neither economical nor sustainable; these are Zuck's biggest advertizers.
https://www.statista.com/statistics/1250606/facebook-advertisers/

I published my first academic paper thirty years ago.
Overnight, a researcher contacted me to ask if I had the data for the main figure in the paper so that they could reproduce it.
Could I help? The data was from the time before Windows, 5.25-inch floppy discs, and graphs sent from a Unix environment directly to a printer for redrawing for publication by a cartographer.
You bet! In my archive, there was the original input data and code (Fortran77). And now shared.
#academia #publishing #science #research #HigherEd #caves #climate #academicchatter #Fortran #datascience

New blog post: A Polars exploration into Kedro 🐻❄️
Ahead of @astrojuanlu's workshop on PyCon Lithuania this week, he described in this blog post what's the current status of Polars support in Kedro, how can you use it instead of pandas, and what can you expect in the future.
https://kedro.org/blog/a-polars-exploration-into-kedro
@ritchie46, creator of Polars, will be at the event as well talking about the future of the project.
Rescuing a series of blog posts I wrote at `{{ previous_job }}` for my @thepracticaldev blog. Starting with the first one:
"Demystifying Apache Arrow"
https://dev.to/astrojuanlu/demystifying-apache-arrow-5b0a
#DEVCommunity #python #arrow #datascience #dataframes #pydata
🐍 Python Pulse 🐍 is going live in 10 minutes with @BajoranEngineer and Jeffrey Mew. We're talking about how the new Data Wrangler extension is revolutionizaing the way you clean your #data into your code editor ▶️ https://youtube.com/live/5EeBSNr0x3Y @pythonvscode #DataScience #pythoncode https://t.co/hCHkw4C5Hh
:sys_twitter: https://twitter.com/code/status/1657080704285523968
I'm reading Masterminds of Programming - which includes a chapter about APL, so I thought I'd give it a go here: https://tryapl.org/
It is completely bonkers! Reminiscent of Haskell but with the added obfuscation of using its on set of weird special characters.
For example, ⌹ invokes the matrix inversion function
Help! Looking for a vaguely-remembered toot from the past week
It was a screenshot of a document from Google, on the subject of training ML/AI, discussing how smaller, high-quality datasets were better
Ring any bells?
plotnine 0.12.1 is out! 🎨
plotnine is an implementation of the Grammar of Graphics in Python. In other words: "ggplot2 meets Python".
This version has a new layout manager (so it's easier to avoid overlapping objects), a new `save_helper()` method gives you access to the matplotlib figure, and much more.
Install it with `pip install "plotnine==0.12.1"`
Complete release notes: https://github.com/has2k1/plotnine/releases/tag/v0.12.1
A practical handbook on data engineering with Rust:
pandera 0.15.0 is out!
pandera allows you to define schemas for your DataFrames, tighten them with rules, and validate your data to prevent errors.
The new version ships support for pandas 2.0, bare data dtypes for schemas, default values, and more.
Install it with `pip install "pandera==0.15.0"`
More information https://github.com/unionai-oss/pandera/releases/tag/v0.15.0
NEU
Die Slides, Notizen, Code und Vertiefungshinweise für meinen Vortrag "Legal Data Science: Der moderne Weg zur Wahrheit" bei der Digitalen Richterschaft sind jetzt online!
Inklusive #RStats Praxisteil zum Nacharbeiten!
Alle Downloads: https://doi.org/10.5281/zenodo.7877803
#Law #DigitalHumanities #Rechtsstaat #Wahrheit #Gerechtigkeit #DataScience #OpenScience #OpenData #OpenSource #OpenAccess #LawFedi @rstats @law @politicalscience
Listen to this interview to Yetunde Dada and Ivan Danov, Product Director and Engineering Director for Kedro 🎙️ Thanks to Adam Kawa from GetInData | Part of Xebia for recording it!
The Invisible Workload of Open Research :ablobcatknitsweats:
"It is argued that there is a high chance that without intervention, increased expectations to engage in open research practices may lead to unacceptable increases in demands on academics"
#OpenScience #openaccess #preprint #datascience #fair
#academicchatter
https://journal.trialanderror.org/pub/the-invisible-workload/release/1