Skip to content

nabin2004/pydplyr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

33 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ pydplyr

Simplicity and conciseness of R,
with the blazing speed of Rust.

Chainable โ€ข Composable โ€ข Declarative โ€ข Semantic โ€ข Agent-Ready


๐ŸŒŸ TL;DR

pydplyr is a modern, expressive, and chainable data manipulation library built for humans and machines. Think:

  • ๐Ÿง  Like dplyr in R
  • โšก๏ธ With Python's ecosystem
  • ๐Ÿš€ On a path to a Rust-powered backend
  • ๐Ÿค– Agent-ready for autonomous systems

Whether you're a data scientist, a developer building intelligent systems, or a machine ๐Ÿค– looking for clarityโ€”this library speaks your language.


โœจ Key Features

  • โœ… Composable Verbs โ€“ clear, expressive syntax for fast prototyping & serious work
  • ๐Ÿงฌ Chainable API โ€“ minimal boilerplate, max readability
  • ๐ŸŽจ Grammar of Graphics โ€“ familiar design for layered visualizations (Coming Soon!)
  • ๐Ÿ” Simplified Regex โ€“ harness RegEx without the brain melt
  • ๐Ÿ”— Agent-First Thinking โ€“ semantically rich, logic-oriented operations
  • ๐Ÿฆ€ Rust Ambition โ€“ future versions will compile to Rust for blazing performance

๐Ÿ› ๏ธ Verbs You Can Use Today

Each verb is purpose-built and plays well with others. Start small, scale infinitely.

Verb Description
arrange() Sort your data
select() Pick columns
filter() Subset rows
mutate() Create or modify columns
summarize() Aggregate and reduce
group_by() Enable grouped operations
distinct() Drop duplicate rows

๐Ÿ” Sample Chain

from pydplyr import *

result = (
    Panel(df)
    .arrange(desc("score"))
    .filter("score > 80")
    .mutate(score_plus_age="score + age")
    .select("name", "score_plus_age")
    .collect()
)

Intuitive, readable, chainable. One thought per line.


๐Ÿ“ˆ Grammar of Graphics (WIP)

Just like in ggplot2, our graphics philosophy follows this layered system:

  • Data โ€“ the DataFrame
  • Aesthetics โ€“ x/y mappings, color, shape
  • Geoms โ€“ bar, point, line, etc.
  • Stats โ€“ transformations like count or smooth
  • Facets โ€“ split plots by category
  • Coords โ€“ coordinate systems (polar, cartesian, etc.)
  • Themes โ€“ polish for publication or dashboard

๐Ÿ“Š Visuals should tell, not yell.


๐Ÿ”ก Simplified RegEx

We believe RegEx shouldn't be a dark art.

Panel(df).filter_col("email", pattern=".*@example.com")

You get the full power of re, simplified into expressive helpers for real-world usage.


๐Ÿ”ฎ Vision

The future of data is semantic, composable, and intelligent.

pydplyr is being designed with agentic AI frameworks in mind โ€” where the code can be read and written by both humans and agents.

Whether itโ€™s embedded in LLM-based agents or running as the logic core of autonomous data pipelines, pydplyr is made to be interpretable, traceable, and chainable.


๐Ÿงช Roadmap

  • Core verbs (arrange, select, filter, mutate, summarize, distinct)
  • Grouped operations
  • Grammar of Graphics module
  • Rust backend (via pyo3 or polars)
  • Natural-language Regex builder
  • LLM prompt-to-code interface
  • Optional async API for distributed computing
  • Plugin system for custom verbs and visual geoms

๐Ÿ“ฆ Installation

pip install pydplyr

๐Ÿค Contributing

We welcome contributors who care about:

  • elegant APIs ๐Ÿงผ
  • expressive code ๐Ÿ’ฌ
  • performance ๐Ÿ”ฅ
  • semantic richness ๐ŸŒ
  • and dreaming big ๐Ÿ’ก

To get started, clone the repo and check the CONTRIBUTING.md guidelines.


๐Ÿ’ฌ Community

Got ideas? Found bugs? Want to build the future of data science?

  • Open an issue
  • Start a discussion
  • Or just drop by with a star โญ๏ธ

๐Ÿง  Quote to Code

"The art of data science is not in the numbers โ€” itโ€™s in the story they tell, and the tools that let them speak."

โ€”

โœจ pydplyr: Built to Think. Built to Chain. Built for Tomorrow.

About

pydplyr : Simplicity and conciseness of dplyr.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors