Masthash

#dataframe

Андрусь | Andrus
1 month ago

Just posted a fairly big release of DFLib - a Java #DataFrame library.

The core structures (DataFrame, Series) have a lot of methods, so it is sometimes hard to find your way around. The release provides a better logical naming and introduces a simple rule - "no operations on a DataFrame should return a Series". This results in a more "flowable" and intuitive API.

Also, there are a few new Exp functions and bug fixes.

https://github.com/nhl/dflib/blob/master/RELEASE-NOTES.md

Statistics Globe
2 months ago

ChatGPT Coding - Creating a Data Frame in R!

Check out this helpful R tutorial for initializing a dataframe with ChatGPT!

#rstats #RStudio #ChatGPT #MachineLearning #dataframe

Statistics Globe
2 months ago

ChatGPT Coding - Removing Bottom Row from Python Data Frame!

Check out this useful ChatGPT Python tutorial for removing a bottom row from a dataframe!

#python #pythoncode #ChatGPT #AI #dataframe #openai

Statistics Globe
2 months ago

ChatGPT Coding - Creating a Data Frame in Python!

Check out this useful Python tutorial for initializing a new dataframe with ChatGPT!

#python #pythoncode #ChatGPT #AI #dataframe

Statistics Globe
2 months ago

No Mess in R Data Frames!

How to clean the column names of a data frame in the R programming language.

#rstats #dataframe #datacleaning #programming

Daniel Huppmann
4 months ago

To review the (draft) update of national #SSP projections for #GDP and #population, you can...

1. View the data via our interactive #ScenarioExplorer hosted by the #IIASA #ScenarioServices team
2. Download the full dataset as xlsx
3. Access the data via our #Python or #Rstats API

And with our #opensource package #pyam_iamc, it's as easy as the snippet below...
... to get something that's just like a #pandas #DataFrame - but better!

#ReadTheDocs at https://pyam-iamc.readthedocs.io

Python code to query the IIASA SSP-Review database for analysis & data visualization:

import pyam
df = pyam.read_iiasa("ssp",  model=...,  scenario=...)
eric_ma
4 months ago

At least there are couple of options when we are working with Multi-index data that can be particularly helpful for slicing or filtering data:

#python #pandas #datascience #dataframe

- pandas.DataFrame.xs
- pandas.IndexSlice

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.xs.html

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.IndexSlice.html?highlight=indexslice#pandas-indexslice

Statistics Globe
4 months ago

Don't Make This MISTAKE with R Data Classes !!!

How to convert factors to the numeric data type without changing the factor level numbers in the R programming language.

#rstats #datatypes #programming #dataframe

Statistics Globe
4 months ago

Great TRICK for DATAFRAMES in Python!!!

How to compare two pandas DataFrames and find differences between rows in the Python programming language.

#python #dataframe #programming #programmer

Statistics Globe
5 months ago

Python Indices Can Really Be Used Like This???

This video explains how to insert a new row at a certain index position of a pandas DataFrame. We have to slice our data!!!

#python #pythonprogramming #index #dataframe

Statistics Globe
5 months ago

MERGE Two Data Sets in 55 SECONDS in R!

How to join multiple data frames using the merge() function in the R programming language.

#rstats #dataframe #programming #rstudio

Michael Simons
6 months ago

Not much data yet, but I have kinda finished my static #Jupyter based #Photovoltaics dashboard build around a pink database approach on #DuckDB, using their #Python #Dataframe abstraction: http://simons.ac/pv/

Sources: https://github.com/michael-simons/pv/blob/main/notebooks/Photovoltaik%20%7C%20Familie%20Simons%2C%20Aachen.ipynb

Stefan Zitz
11 months ago

Today I got stuck in automation hell 🔥 . Wanted a script that performs #simulations, read the results, push them into a #DataFrame and write that frame to a #CSV. While that sounds reasonable I quickly encountered the fact that I didn't use a coherent naming scheme for my simulations 😩

osunderdog
1 year ago

Wrote up benchmark difference between #python #pandas #dataframe `iloc` and `to_dict`

https://www.wafermovement.com/2022/dataframeiloc/

Juan Luis
1 year ago

In one hour I will be giving a talk about #Polars, my new favourite #Python #dataframe library! Join me at #PyDataGlobal:

https://pydata.org/global2022

Screenshot of the PyData Global 2022 website
Sundar R :julia:
1 year ago

#JuliaBeginners #JuliaTipOfTheDay
2022-11-26

When you see `foo.bar` in your #JuliaLang code, Julia sees that as the call `getproperty(foo, :bar)`.

Let's say you load data into a #DataFrame `df`. When you access `df.Species`, that calls `getproperty(df, :Species)`.
DataFrames.jl has a matching method for that call, which returns `df[!, :Species]` i.e. makes it a column access.

You can do that for your custom types too! (Useful to understand "fields" before that, a post on those is upcoming.)

Veit Schiele
2 years ago

RT @psycopg@twitter.com
Hey, #DataScience there!
I want to improve communication between #Pandas #dataframe and #PostgreSQL, making it faster and easier to use!
What is the typical use case? What data types are dataframes usually stored as, in PostgreSQL?
Thank you!

heise+ | Datenanalyse mit Python: Erste Schritte mit Pandas – Fussballdaten auswerten

Pandas ist die Standard-Bibliothek für alle, die mit Python Daten verarbeiten möchten. So lässt sich auch die erste Bundesligasaison einfach analysieren.
Datenanalyse mit Python: Erste Schritte mit Pandas – Fussballdaten auswerten
linuxwebdevelopment.com
2 years ago

A new article is up.
How to filter for a specific value in a dataframe in pandas #pandas #DataScience #dataframe
https://linuxwebdevelopment.com/how-to-filter-for-a-specific-value-in-a-dataframe-in-pandas/

Python Daddy
3 years ago

Here's a great #Python #DataScience tutorial: https://www.pythondaddy.com/python/data-manipulation-in-python-pandas/
You will learn a lot about different techniques to carry out data manipulation with the excellent Python package called Pandas.

#Programming #machinelearning #data #bigdata #dataframe #tutorial #ai #ml #artificialintelligence #science

Nils Reichert
4 years ago

Useful overview of how to add #rows (and columns) to a #pandas #dataframe: https://kanoki.org/2019/08/03/add-new-rows-and-columns-to-pandas-dataframe/ Includes „Insert row at specific Index Position“ which I didn't find easily elsewhere. Tip: when using df.iterrows() for cycling and inserting rows at specific conditions, don't resort/reset the index after each insertion but after the loop has ended. (; #python

The Pharo Project
5 years ago

RT @oleks_lviv@twitter.com #Pharo #DataFrame just reached 301 green unit tests and 90% code coverage 🥳 Soon we will release DataFrame v2.0 accompanied with a dedicated booklet
https://t.co/y4Dy7zBrBX
@pharoproject@twitter.com @poly_math_org@twitter.com

aRubes 🎗
5 years ago

Rule number 1 of #r : never use #tibble or #dataframe if you don't have to. 🤐

The Pharo Project
6 years ago

RT @oleks_lviv@twitter.com Celebrating 200 commits to #DataFrame repository!
@poly_math_org@twitter.com #Pharo
https://t.co/y4Dy7zBrBX