- San Francisco Bay Area
- https://www.linkedin.com/in/dongjoon
- @dongjoonhyun
Highlights
Stars
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Easy to maintain open source documentation websites.
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
A massively parallel, optimal functional runtime in Rust
A massively parallel, high-level programming language
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Java SDK for building Kubernetes Operators
Creates CycloneDX Software Bill of Materials (SBOM) from Maven projects
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Apache Spark - A unified analytics engine for large-scale data processing
Legacy mirror of Darwin Kernel. Replaced by https://github.com/apple-oss-distributions/xnu
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
https://openjdk.org/projects/jdk/17 released 2021-09-14
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
FUSE-based file system backed by Amazon S3
lzbench is an in-memory benchmark of open-source LZ77/LZSS/LZMA compressors