Skip to content
View julienledem's full-sized avatar

Highlights

  • Pro

Organizations

@apache @Parquet @MarquezProject @OpenLineage

Block or report julienledem

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Apache DataFusion SQL Query Engine

Rust 6,268 1,184 Updated Nov 10, 2024

Astra is a structured log search and analytics engine developed by Slack and Salesforce

Java 211 29 Updated Nov 9, 2024

perfect programming language

11,458 366 Updated Nov 8, 2024

An example how to embed YouTube videos in your GitHub Pages

HTML 26 39 Updated Sep 14, 2022

Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code

Python 653 168 Updated Nov 7, 2024

BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)

C++ 225 19 Updated May 7, 2024

An Open Standard for lineage metadata collection

Java 1,762 306 Updated Nov 10, 2024

An OIDC provider integrator. Choose your social providers without needing to write code.

Java 121 18 Updated Dec 31, 2023

Making DAG construction easier

Python 242 11 Updated Nov 8, 2024

Guideline to extract table lineage info in OpenLineage format from access history view

10 5 Updated May 11, 2023

Collect, aggregate, and visualize a data ecosystem's metadata

Java 1,774 318 Updated Nov 10, 2024

Egeria core

Java 808 261 Updated Nov 9, 2024

Chrome extensions. Currently mostly for dealing with my tab problem.

JavaScript 4 1 Updated May 2, 2024

Simple metrics pipeline to track the growth of the OpenLineage community

Python 2 1 Updated Dec 15, 2021

🐯 visx | visualization components

TypeScript 19,495 716 Updated Nov 7, 2024

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,412 2,425 Updated Nov 8, 2024

Airflow support for Marquez

Python 32 13 Updated Dec 11, 2020

Marquez Web UI

TypeScript 22 6 Updated Nov 13, 2020

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,581 1,700 Updated Nov 9, 2024

A python-ish pure and total functional programming language

Scala 225 11 Updated Nov 9, 2024

Iceberg is a table format for large, slow-moving tabular data

Java 478 60 Updated Apr 10, 2023

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 14,552 3,537 Updated Nov 9, 2024

Apache Parquet

442 192 Updated May 7, 2024

Apache Parquet Format

Thrift 1,801 432 Updated Nov 8, 2024

GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.

Scala 1,433 433 Updated Nov 8, 2024

Apache Calcite

Java 4,601 2,371 Updated Nov 8, 2024

A collection of Apache Parquet add-on modules

Scala 29 8 Updated Oct 20, 2024

The official home of the Presto distributed SQL query engine for big data

Java 16,039 5,375 Updated Nov 9, 2024

Apache Parquet Java

Java 2,633 1,408 Updated Nov 10, 2024

An example type provider, which uses type macros from Scala macro paradise

Scala 10 2 Updated Dec 25, 2012
Next