- Berkeley
-
01:14
(UTC -08:00) - https://julien.ledem.net/
- @J_
- @julien.ledem.net
- https://sympathetic.ink/
Highlights
- Pro
Stars
Astra is a structured log search and analytics engine developed by Slack and Salesforce
An example how to embed YouTube videos in your GitHub Pages
Run your dbt Core projects as Apache Airflow DAGs and Task Groups with a few lines of code
BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)
An Open Standard for lineage metadata collection
An OIDC provider integrator. Choose your social providers without needing to write code.
Guideline to extract table lineage info in OpenLineage format from access history view
Collect, aggregate, and visualize a data ecosystem's metadata
Chrome extensions. Currently mostly for dealing with my tab problem.
Simple metrics pipeline to track the growth of the OpenLineage community
Upserts, Deletes And Incremental Processing on Big Data.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
A python-ish pure and total functional programming language
Iceberg is a table format for large, slow-moving tabular data
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
A collection of Apache Parquet add-on modules
The official home of the Presto distributed SQL query engine for big data
An example type provider, which uses type macros from Scala macro paradise