Hello! My name is Daehong Jeon
I'm Software Engineer / Data Engineer.
Blog
|
Email
|
Linkedin
|
π» Data Platform & Lakehouse Engineer
- Extensive experience in the finance and e-commerce domains, covering everything from Hadoop-based ecosystems to modern cloud-native data platforms.
- Skilled in processing tens to hundreds of terabytes of data, developing Spark-based distributed ETL pipelines, optimizing Hadoop legacy systems, and designing highly available, failover-ready, and concurrency-controlled data infrastructures.
- Experienced in designing and building Data Lakehouse architectures, operating large-scale Amazon Redshift-based Data Warehouses, and developing production-grade ETL orchestration with Apache Airflow.
- Hands-on experience in metadata management, data governance, workflow automation, platform reliability, and large-scale data platform operations.
- Designed and implemented OWL ontology-based Virtual Knowledge Graph (VKG) architectures for semantic data integration and knowledge-driven analytics.
π₯ Leadership & Management
- Experience leading data platform projects from architecture design to production operation, while coordinating stakeholders and driving technical decision-making.
- Actively involved in mentoring, knowledge sharing, technical leadership, and improving engineering productivity through automation and operational excellence.
π I love contributing to open source
- Active contributor to data engineering projects such as Apache Iceberg and Apache Kafka, including issue triage, bug investigation, PR reviews, and documentation improvements.
- Open Source Mentoring (2025.05~) β Engaged in fostering a healthy and sustainable open source culture in Korea.
π― My Goals
- To grow as a technology leader who continuously adopts and disseminates emerging technologies while building strong engineering cultures.
- To design and operate scalable, reliable, and maintainable data platforms that support high-volume, mission-critical workloads.
- To contribute to both business success and the open source ecosystem through technical excellence, leadership, and knowledge sharing.
| project | summary | Type | link | date |
|---|---|---|---|---|
| Fixed Markdown rendering in Row-level Deletes section (bullets, duplicates, line breaks). | Docs | PR | 25.09 | |
Added missing super.validate() call in OAuth2TokenResponse.validate(). |
Bug Fix, Improvement | PR | 25.09 | |
Fixed incorrect field assignments in TestModelMetaService tests. |
Bug Fix, Test | PR | 25.08 | |
| Added docs for Table Maintenance in Flink (file compaction, orphan removal, snapshot expiration). | Docs | PR | 25.08 | |
Fixed early return in GroupMetaService.updateGroup that ignored non-role updates. |
Bug Fix, Test | PR | 25.08 | |
Fixed NullPointerException in EntityCombinedFileset with null-safe hiddenProperties. |
Bug Fix, Test | PR | 25.08 | |
Fixed null-safe handling of managed property in CreateFileset.java. |
Bug Fix, Test | PR | 25.08 | |
Added request.validate() in PartitionOperations.java for proper error handling. |
Improvement | PR | 25.08 | |
| Migrated Flink catalog tests from JUnit4 to JUnit5. | Improvement(Core) | PR | 25.05 | |
| Backported JUnit5 migration to Iceberg 1.19 and 1.20. | Improvement(Core) | PR | 25.05 | |
| Updated test code and docs for Kafka retry topic template bean name. | Improvement, Docs | PR | 24.10 |




