Masthash

#datascience

Aalto Scientific Computing
42 minutes ago

Our yearly #SciComp / # HPC Kickstart course starts tomorrow. It's #livestream on EU-afternoons, so the whole world is invited! (6-8 June 2023, starting 10:50 CEST, 4hr/day) #RSEng #DataScience #Computing

Day 1: general #SciComp basics + connecting, Days 2-3: #HPC usage

https://scicomp.aalto.fi/training/scip/kickstart-2023/

Ellis Hughes
11 hours ago

How do you make a report from your #shiny app?

Learn how with Patrick Ward and I in epsiode 147 of #TidyX! We expand on epsiode 146, discussing techniques and use some fun {shinyjs} to enable and disable buttons!

#rstats #datascience #reports

Bit.ly/TidyX_Ep147

Jonathan Kamens
11 hours ago

Do you burn more calories running than walking?

If your exercise is time-constrained, you'll (obviously) burn more calories the faster you go whether you run or walk. If your exercise is distance-constrained, you'll burn slightly more calories running than walking.

https://blog.kamens.us/2023/06/04/do-you-burn-more-calories-running-than-walking/

#Ramblings #DataScience #exercise

Ben Lorica 罗瑞卡
17 hours ago

New Newsletter 📥 Get the inside scoop on GPT-4 & PaLM 2. Unpack the intricacies of these foundation models and understand the evolution of #LLMs
#NLproc #MachineLearning #DataScience
https://gradientflow.substack.com/p/what-you-need-to-know-about-gpt-4

Alireza Dehbozorgi
20 hours ago

I'm privileged to announce that I've just published the 3rd part of my blog on 'Computational Theories of Cognition: Models of Causal Inference' on @medium. I did my best cover as much as possible. Comments are highly welcome!

https://medium.com/@alirezadehbozorgi83/computational-models-of-cognition-modes-of-inductive-reasoning-d01567bf7f53

#cognitivescience #linguistics #computation #knowledgegraphs #reasoning #datascience #ai #blog #medium

https://medium.com/@alirezadehbozorgi83/computational-models-of-cognition-modes-of-inductive-reasoning-d01567bf7f53
Harald Klinke
1 day ago

The Chair of Humanities Data Science and Methodology works at the intersection of historical research, oral history, digital cultural heritage and the digital humanities.
Follow here: https://mastodon.social/@HDSM

#datascience #digitalhumanities

Data Science Jobs Canada
1 day ago
Jonathan Kamens
1 day ago

Do you burn more calories if you walk faster?

If your walk is time-constrained, walking faster burns more calories. If your walk is distance-constrained, walking faster makes no difference.

https://blog.kamens.us/2023/06/03/do-you-burn-more-calories-if-you-walk-faster/

#Ramblings #DataScience #exercise #Garmin

Ben Lorica 罗瑞卡
2 days ago

New Newsletter 📥 Get the inside scoop on GPT-4 & PaLM 2. Unpack the intricacies of these foundation models and understand the evolution of #LLMs
#NLproc #MachineLearning #DataScience
https://gradientflow.substack.com/p/what-you-need-to-know-about-gpt-4

Kubuntu Focus
2 days ago

The #Linux Systems to get you going!

These just-works Linux systems featuring Kubuntu 22.04 LTS and the beautiful and intuitive #KDE desktop are meant to help you with any task and make it fun too!
See more at: https://kfocus.org/

#Code #Data #Developers #ML #Engineers #Laptop #Desktop #AI #Creators #DataScience #Graphics

Paul King
2 days ago

I'm looking forward to speak about @ApacheGroovy
and Apache Ignite at the Ignite Summit next week. Clustering Whiskey profiles in a cluster! Machine learning at scale!
Find out more details here:
https://ignite-summit.org
#groovylang #datascience #machinelearning

Data Science Jobs Canada
2 days ago
Code The City
2 days ago

Just 2 weeks left to grab a ticket to our Health Hack weekend with NHS Grampian and University of Aberdeen.

Loads of challenges to work on in small teams. All skills needed. https://ti.to/code-the-city/ctc29 #health #datascience #ai #innovation

Madison Python
2 days ago

This Thursday, join us for a workshop on visualizing data with #python, #streamlit, and #plotly. We'll have facilitators from the University of #Wisconsin #DataScience Hub lead us in a participatory workshop, building and customizing a demo app deployed onto the cloud. Bring your laptop!

https://www.meetup.com/madison-python/events/293265891/

Meeting at the #Madison library central branch. (No pizza this time. Sorry!)

Stephanie
2 days ago

Saberseminar scholarship applications are due MONDAY June 5th! #baseball #SportsAnalytics #DataScience

Scholarships | Sabermetrics Scouting and the Science of Baseball http://www.saberseminar.com/scholarships/

Data Quine
3 days ago

Greetings from #Scotland! 🏴󠁧󠁢󠁳󠁣󠁴󠁿
Professional developer, #DataScience newbie. Considering changing careers. Hope to go through a #DataAnalysis bootcamp later in the year. Will use this account for recording learning. Love to find #Scottish datasets
#Introduction

Essi Parent
3 days ago

The ChatGPT plugin of Noteable is magic. It took me seconds to build an optimized and reliable machine learning model with #rstats and #tidymodels. "A notitia physicus lupus alii notitia physicus". I have spoken.
https://app.noteable.io/f/66fe418e-aceb-419d-b1f8-17eb2eb1be04/ChatGPT-Plugin-for-Jupyter-Notebooks.ipynb #datascience

Data Science Jobs Canada
3 days ago
stjuderesearch
3 days ago

Join us Thursday, June 8, at noon CT/1 pm ET for a special webinar, “New BIG data resources on St. Jude Cloud.” Panelists from St. Jude will discuss and perform a live demo of the Pediatric Cancer Knowledgebase (PeCanV2) and Survivorship Portal. Register today. https://bit.ly/SOCC23-MAS
#SOCC23 #ChildhoodCancer #BigData #DataScience

Challenge your assumptions about numbers and understanding 🤔🧐

with Climateer's thought-provoking article.

Explore the limitations of relying on numbers as shortcuts to understanding complex systems, and learn how to think critically and holistically about the issues facing our world today.

My newsletter subscribers learned about this 9 months ago!
https://dramsch.net/newsletter

Check it out here:
https://climateer.substack.com/p/numbers

#DataScience #LateToTheParty

Ben Lorica 罗瑞卡
3 days ago

New Newsletter 📥 Get the inside scoop on GPT-4 & PaLM 2. Unpack the intricacies of these foundation models and understand the evolution of #LLMs
#NLproc #MachineLearning #DataScience
https://gradientflow.substack.com/p/what-you-need-to-know-about-gpt-4

David Beavan
3 days ago

✅ Sign up, Q&A 🎤, international 🌐: What skills are needed for #DataScience #AI to fully deliver in digital humanities, linguistics, museums, libraries, collections, natural history and more? Tues 6 June 18:00 BST host Alan Turing Institute and @LivingWithMachines https://livingwithmachines.ac.uk/event/ai-beyond-stem-digital-skills-to-unleash-the-power-of-data-science-and-ai-for-all/

AI Beyond STEM sign-up screenshot
James Ashford
3 days ago

I've been building and playing with my own #neuralnetworks in #python recently. My favourite has got to be the self-organising map #SOM.

https://jrashford.com/2023/06/02/how-self-organising-maps-work-explained-with-graphics/

#datascience #dataviz

Data Science Jobs Canada
3 days ago
Ben Lorica 罗瑞卡
4 days ago

New Newsletter 📥 Get the inside scoop on GPT-4 & PaLM 2. Unpack the intricacies of these foundation models and understand the evolution of #LLMs
#NLproc #MachineLearning #DataScience
https://gradientflow.substack.com/p/what-you-need-to-know-about-gpt-4

R-Podcast (Eric) :pci:
4 days ago

Episode 124 of the @rstats @rweekly Highlights Podcast is out now! https://podverse.fm/episode/yWDOzHEVh

🧩 Curried functions in R with {purrr} partials (Michael Decrescenzo)
📦 {fusen} simplifying writing packages @statnmap @RConsortium
🗺️ Parliament proportions @peter_ellis

Happy with your current podcast app but want to send a boost? You can do that directly on the Podcast Index! Find us at https://podcastindex.org/podcast/1062040

h/t @mike_thomas @R_by_Ryo
#rstats #datascience #v4v

Physalia-courses
4 days ago

🚀Join our immersive 3-day course and master R, #tidyverse, and #ggplot2 to create stunning scientific plots. Enhance your skills, communicate impactfully, and let your data tell a captivating story. Don't miss out: https://physalia-courses.org/courses-workshops/ggplot23/

#Rstats #Datavisualization #DataScience

Eamonn Kerins
4 days ago

Early appearance on the #bbcbreakfast sofa this morning with Charlie and Nina talking about #UnidentifiedAnomalousPhenomena (#uap aka #ufo). A #nasa Independent Study Team had its first public meeting yesterday. It's not investigating UAP reports, but is recommending that the public report sightings without stigma, that #nasa and other bodies bring together relevant datasets , and that new #datascience approaches such as #machinelearning be used. Recommendations will be published next month.

On BBC Breakfast this morning with Charlie Stayt and Nina Warhurst.
On BBC Breakfast this morning with Charlie Stayt and Nina Warhurst.
Kedro
4 days ago

Kedro 0.18.9 is out! 🔶

We added support for a metadata attribute in datasets, introduced a new `kedro.logging.RichHandler` that is more flexible and configurable, fixed some bugs with `OmegaConfigLoader`, and made substantial improvements to our deployment docs.

Install it now with pip or conda:

```
pip install kedro==0.18.9
conda install -c conda-forge kedro==0.18.9
```

And read the complete release notes online: https://github.com/kedro-org/kedro/releases/tag/0.18.9

#kedro #python #pydata #datascience #machinelearning

Coding Gardener
4 days ago

Really enjoyed your presentation @JessButler

Brilliant work!

#WiDS2023 #MentalHealth #DataScience

R-Ladies Rome
4 days ago

👉 Data Science Best Practices 🥰
@rladiesrome is hosting a new event on June 12 at 6pm CEST/ 12pm EDT

🗣️@siminaboca@bird.makeup between the Celebrating Women in Statistics 2021 (🔗bit.ly/3IRji6T) will be our speaker

RSVP: https://www.meetup.com/rladies-rome/events/293609269/

#rstats #rladies #DataScience

Tim Newman
4 days ago

When you are starting a new #datascience project but you have to use windows and aren't allowed docker. And it crashes (see windows) and you forget to switch back to your environment to install the rest of the packages and mess up #anaconda base:

(Alt text here otherwise it deletes gif- Elen Rippley from alien: I say we take off and nuke it from orbit... It's the only way to be sure)

Code The City
4 days ago

There are now less than three weeks until our Health Hack weekend where we will be working with NHS Grampian and University of Aberdeen on some real world challenges.

With 30+ attendees already signed up, this promises to be one of our most impactful and enjoyable sessions for some time. 

More details of the event, the challenges, and how to get a ticket: https://codethecity.org/ctc29/

#health, #datascience #civtech #techforgood #innovation #collaboration

julia ferraioli :cc_by:
4 days ago

How we collect information matters as does how we analyze, share, and build upon it.

When people voluntarily give us data, they are giving us their trust.

#Data #DataScience #Ethics #SocialJustice #Equality

https://www.juliaferraioli.com/blog/2023/influential-articles-may/

Kedro
5 days ago

kedro-datasets 1.4.0 is out! 🔶 With a new SparkStreamingDataSet!

kedro-datasets is a separate PyPI package where Kedro datasets live. ⚠️ Notice that `kedro.extras.datasets` is deprecated and will be removed in Kedro 0.19, so install the new package now!

```
pip install "kedro-datasets==1.4.0"
```

#kedro #datascience #python #pydata #spark #pyspark

Carl Gold, PhD
5 days ago

T-Mobile is using #AI to fight #churn - this article also has #churnrate stats for #mobilephone : T-Mobile .89%, Verizon .84%, AT&T .81%. #machinelearning #datascience #analytics

https://www.phonearena.com/news/t-mobile-ai-will-reduce-churn_id147823

TIB
5 days ago

In unserem Format "Drei Fragen an..." stellen wir diese diesmal an Dr. @oliver_karras, Post-Doc und #DataScientist in der #TIB-Forschungsgruppe #DataScience and #DigitalLibraries.
Das komplette Interview über #KünstlicheIntelligenz (#KI, #AI), #ChatGPT, #Wissensgraph|en und die Beantwortung wissenschaftlicher Fragen findet ihr hier: https://blogs.tib.eu/wp/tib/2023/05/26/drei-fragen-an-dr-oliver-karras/

Alex Nedelcu ☕️
5 days ago

Course on Data Science

Shared #link (#DataScience).

https://e2eml.school/blog.html

📢 Don't miss our #DataScience talk by Andreas Baumann from @DH_UniWien about the origins of #SemanticDiversity in language and #LanguageEvolution on 📅 Mon 12 June @ 14:00 CEST on-site @univienna or online #Zoom
https://datascience.univie.ac.at/dsunivie-talks/about/news/andreas-baumann-thats-the-key-modeling-the-effects-of-non-conformist-behavior-and-perceptual-gr/

#DSHQ

Flyer with talk invitation. Content
Logo of University of Vienna
organisation research network Data Science @ Uni Vienna
dsUniVie Talk
Andreas Baumann, Department of European and Comparative Literature and Language Studies
That's the key! – Modeling the effects of non-conformist behavior and perceptual granularity on semantic diversification
Monday, 12 June 2023 @ 14:00–15:00 CEST
hybrid event:
on-site @ Seminarraum 5, Kolingasse 14, 1090 Vienna
online @ Zoom
link: datascience.univie.ac.at

Check out Cassie Kozyrkov's "Stats Gist List," 📊📚

an irreverent statistician's guide to jargon

From "p-value" to "overfitting," gain a deeper understanding of key statistical concepts in an engaging and accessible way.

My newsletter subscribers learned about this 13 months ago!
https://dramsch.net/newsletter

Check it out here:
https://towardsdatascience.com/stats-gist-list-an-irreverent-statisticians-guide-to-jargon-be8173df090d

#DataScience #LateToTheParty

Can you answer this week's interview question? 🤔

What is the F1-score and when would you choose it over other metrics?

Leave your answer below! 👇

Get the answer next week here: https//dramsch.net/newsletter

#DataScience #MachineLearning #Career #LateToTheParty

Juliane Jarke
1 week ago

We have two open professorships in #MachineLearning and #DataScience | #DataAnalytics at the University of Graz in the newly founded interdisciplinary Idea_Lab

Great colleagues & quality of life

https://jobs.uni-graz.at/prof/job.php?lang=en&job=7547

https://jobs.uni-graz.at/prof/job.php?lang=en&job=7549

Linda McIver
1 week ago

This was a fascinating, illuminating, and enraging conversation with Dr Richard Denniss, Executive Director of The Australia Institute, and Professor Margaret Hellard, Deputy Director of the Burnet Institute, about why we're not collecting covid data, and why it matters. This one is really important, so please listen, and share.

RD:"Why aren't we collecting the data? Because they don't want to admit failure. They don't want to make it easy for me to tell you what the cost to GDP of this heroic approach to covid has been."

#data #covid #STEM #PublicHealth #DataScience #epidemiology #auspol

https://adsei.org/podcast/covid-data-why-arent-we-collecting-it-anymore/

Can you answer this week's interview question? 🤔

What is the F1-score and when would you choose it over other metrics?

Leave your answer below! 👇

Get the answer next week here: https//dramsch.net/newsletter

#DataScience #MachineLearning #Career #LateToTheParty

Fedi.Garden 🌱
2 weeks ago

Datasci.social is a server for researchers & practitioners in human-centric data science, broadly defined, like network science, computational social science, geospatial data science:

:Fediverse: https://datasci.social

For more info see their About page at https://datasci.social/about or ask their admin @mszll

#FeaturedServer #DataScience #DataSci #NetworkScience #Networks #ComputationalSocialScience #SocialScience #DataViz #Geospatial #GeospatialDataScience #Data #Science #Academic #Fediverse

ResearchBuzz
2 weeks ago

#WHO #PublicHealth #OpenData #BigData #DataScience

'Starting with the data underlying WHO's annual World Health Statistics report, the new website reimagines the indicator page – the most representative level of data presentation – with consistent, expressive and accessible visualization, while also presenting metadata to promote ease of accessibility, reference and use.'

https://www.who.int/news-room/feature-stories/detail/who-releases-data.who.int

R-Podcast (Eric) :pci:
2 weeks ago

Episode 123 of the @rstats @rweekly Highlights Podcast blends nicely with the R community! https://podverse.fm/episode/DmcMBmplP

🕸 HTTP Testing with R @maelle @RConsortium
💹 Introducing {ggblend} @mjskay
📆 Handling dates in R & Excel @AbrahamsAmieroh@twitter.com @jumpingrivers

Happy with your current podcast app but want to send a boost? You can do that directly on the Podcast Index! Find us at https://podcastindex.org/podcast/1062040

h/t @mike_thomas @batool664@twitter.com 🙏

#rstats #datascience #podcasting2.0 #v4v

A whole new way to manipulate images!
Just "Drag your GAN"! 😍

This groundbreaking research paper introduces an interactive, point-based manipulation technique for GAN-generated images that's sure to blow your mind! 🚀

This looks so fun!

Join my newsletter, where I share these projects every week! https://dramsch.net/newsletter
🧠: https://vcai.mpi-inf.mpg.de/projects/DragGAN/
📝: https://arxiv.org/abs/2305.10973
💻: https://github.com/XingangPan/DragGAN (Coming Soon)

#machinelearning #datascience #python #deeplearning #career #tech

Video of Drag GAN, changing a lion image by just clicking and dragging
Manlio De Domenico
2 weeks ago

I have a figure made with the R package 'maps'.

The paper with that figure got accepted by a journal requiring a CCBy4 license.

What's the license of figures that we produce by using that R package? The code of the package is released under GPL3, but not clear to me if that applies also to the figures produced by me with that package.

#RStat #datascience #license #copyright

Ideas? @franco_vazza @tiago

Ianhopkinson
2 weeks ago

My line manager is dead keen for me to use #Jupyter notebooks convinced they will solve any problem. I've used them before, I don't like them! #Python #DataScience

My main complaint is that execution order/state is not clear and Notebooks aren't particularly git friendly.

Kedro
2 weeks ago

kedro-datasets 1.3.0 is out! 🔶

kedro-datasets is a separate PyPI package where Kedro datasets live. ⚠️ Notice that `kedro.extras.datasets` is deprecated and will be removed in Kedro 0.19, so install the new package now!

```
pip install "kedro-datasets==1.3.0"
```

Highlights: pandas 2.0 and SQLAlchemy 2.0 support, new `metadata` attribute, new ManagedTableDataSet for managed delta tables on Databricks, Polars 0.17 support, and more!

#kedro #datascience #python #pydata

Jack
2 weeks ago

We don't have a Mastodon account (yet) but I think it's worth sharing that Open Data Scotland is hosting its inaugural hack weekend this coming Friday.

Come along if you want to find out more about our community and get involved!

📅 When: Friday 26th May, 18:00 - 22:00 (you don't have to stay for the whole time!)

🔗 Sign up now: https://opencollective.com/opendata_scot/events/hack-evening-0ca24921
#OpenData #Hackathon #Open #Data #Scotland #Event #DataScience

A poster for Open Data Scotland's Hack Evening, captioned:
"Hack Evening
Open Data Scotland's first hack evening
26th May 2023
18:00 - 22:00"
Professor Kerstin Sailer
2 weeks ago

Wow - what a paper on the path(s) from data to insights!
"Variability in research outcomes between researchers can occur even under rigid adherence to the scientific method, high ethical standards, and state-of-the-art approaches to maximizing reproducibility."
https://www.pnas.org/doi/10.1073/pnas.2203150119
#DataScience #Science #reproducibility

Michael Szell
2 weeks ago

This week https://datasci.social has turned 6 months old 🥳

We are 133 people now (68 active), with increased growth in the past few weeks. Our server hosts a community for #DataScience, broadly defined. See below our place in the universe of servers.

Our operation is financed via donations, we are grateful for support: https://community.datasci.social/docs/support/

datasci.social is marked with a yellow circle in the middle of the log-log scatterplot that shows users vs posts of all mastodon servers.
Chart showing growth of users in blue and number of posts in yellow. Users are now at 133, posts above 2300.
Admin dashboard showing increased users and activity in the last month.
Kedro
2 weeks ago

Are you in Amsterdam and want to learn how to effectively create and run maintainable data pipelines in the cloud? 🔶

Join Xebia | Data for an in-person Kedro Code Breakfast at their offices next Tuesday, May 23rd 📅 Seats are limited!

https://events.xebia.com/code-breakfast-kedro/?utm_source=mastodon

#kedro #python #azure #datascience #datapipelines

Event banner "Kedro Code Breakfast"
Code The City
2 weeks ago

Just four weeks unti our Health Hackathon on 17-18 June! Our number of attendees signed up has more than doubled in the last week. And even more challenges are being added each week.

https://codethecity.org/ctc29/

Thanks to NHS Grampian and University of Aberdeen for sponsoring the event and to Robert Gordon University for participating too. Also thanks to ONE Tech Hub for hosting us! See you there!

#health #hackathon #coding #datascience #AI #voicecontrol #LLM #appdevelopment

"Do not sit and be patient with a guy who just nuked 14 Earths to sell more cheeseburgers!"

Fast food and credit cards are neither economical nor sustainable; these are Zuck's biggest advertizers.

https://www.statista.com/statistics/1250606/facebook-advertisers/

#datascience

Top-spending advertizers generating "revenue" for Zuckerberg are:  

Mike Bloomberg
Donald Trump
Joe Biden
Political PACs l
US Census Bureau 
Instagram

https://www.statista.com/statistics/1250606/facebook-advertisers/
Andy Baker
3 weeks ago

I published my first academic paper thirty years ago.

Overnight, a researcher contacted me to ask if I had the data for the main figure in the paper so that they could reproduce it.

Could I help? The data was from the time before Windows, 5.25-inch floppy discs, and graphs sent from a Unix environment directly to a printer for redrawing for publication by a cartographer.

You bet! In my archive, there was the original input data and code (Fortran77). And now shared.

#academia #publishing #science #research #HigherEd #caves #climate #academicchatter #Fortran #datascience

The front page of the paper published in 1993
Kedro
3 weeks ago

New blog post: A Polars exploration into Kedro 🐻‍❄️

Ahead of @astrojuanlu's workshop on PyCon Lithuania this week, he described in this blog post what's the current status of Polars support in Kedro, how can you use it instead of pandas, and what can you expect in the future.

https://kedro.org/blog/a-polars-exploration-into-kedro

@ritchie46, creator of Polars, will be at the event as well talking about the future of the project.

#kedro #polars #pyconlt #python #datascience #pydata

Juan Luis
3 weeks ago

Rescuing a series of blog posts I wrote at `{{ previous_job }}` for my @thepracticaldev blog. Starting with the first one:

"Demystifying Apache Arrow"

https://dev.to/astrojuanlu/demystifying-apache-arrow-5b0a

#DEVCommunity #python #arrow #datascience #dataframes #pydata

🐍 Python Pulse 🐍 is going live in 10 minutes with @BajoranEngineer and Jeffrey Mew. We're talking about how the new Data Wrangler extension is revolutionizaing the way you clean your #data into your code editor ▶️ https://youtube.com/live/5EeBSNr0x3Y @pythonvscode #DataScience #pythoncode https://t.co/hCHkw4C5Hh

:sys_twitter: https://twitter.com/code/status/1657080704285523968

Media source: https://pbs.twimg.com/media/Fv8iHlLWYA4zaXP?format=jpg&name=orig
Ianhopkinson
3 weeks ago

I'm reading Masterminds of Programming - which includes a chapter about APL, so I thought I'd give it a go here: https://tryapl.org/

It is completely bonkers! Reminiscent of Haskell but with the added obfuscation of using its on set of weird special characters.

For example, ⌹ invokes the matrix inversion function

#DataScience #APL #Bookstodon #CompSci

Jess Butler
3 weeks ago

Help! Looking for a vaguely-remembered toot from the past week

It was a screenshot of a document from Google, on the subject of training ML/AI, discussing how smaller, high-quality datasets were better

Ring any bells?

#AI #ML #DataScience #Google

Juan Luis
4 weeks ago

plotnine 0.12.1 is out! 🎨

plotnine is an implementation of the Grammar of Graphics in Python. In other words: "ggplot2 meets Python".

This version has a new layout manager (so it's easier to avoid overlapping objects), a new `save_helper()` method gives you access to the matplotlib figure, and much more.

Install it with `pip install "plotnine==0.12.1"`

Complete release notes: https://github.com/has2k1/plotnine/releases/tag/v0.12.1

#python #pydata #dataviz #plotnine #ggplot2 #datascience

Rust Daily
4 weeks ago

A practical handbook on data engineering with Rust:

https://datawithrust.com/

#rust #rustlang #datascience #data

Juan Luis
4 weeks ago

pandera 0.15.0 is out!

pandera allows you to define schemas for your DataFrames, tighten them with rules, and validate your data to prevent errors.

The new version ships support for pandas 2.0, bare data dtypes for schemas, default values, and more.

Install it with `pip install "pandera==0.15.0"`

More information https://github.com/unionai-oss/pandera/releases/tag/v0.15.0

#python #pandas #pydata #pandera #datascience

Kedro
1 month ago

Listen to this interview to Yetunde Dada and Ivan Danov, Product Director and Engineering Director for Kedro 🎙️ Thanks to Adam Kawa from GetInData | Part of Xebia for recording it!

https://open.spotify.com/episode/2fVKWEI5JG64cGesvjBcxy

#kedro #python #pydata #datascience #podcast #interview

egri-nagy
1 month ago

You run your data analysis twice and you get different results. If you are not happy with this, then #clojure might be a good option.

Immutability begets reproducibility. Language stability brings about code longevity.

Why not used by everyone? Watch this excellent talk by @kira

#datascience #data #analysis

https://youtu.be/xEvkT9YeBQU

PaquitoBernard
1 month ago

The Invisible Workload of Open Research :ablobcatknitsweats:

"It is argued that there is a high chance that without intervention, increased expectations to engage in open research practices may lead to unacceptable increases in demands on academics"

#OpenScience #openaccess #preprint #datascience #fair
#academicchatter

https://journal.trialanderror.org/pub/the-invisible-workload/release/1