Albert WongTrino Connector file for Apache Hudi + HMS + S3 or Apache Hudi + HMS+ Min.IOHudi + AWS S32d ago2d ago
Albert WongOneHouse.ai LakeView, a free product to get monitoring and insights into Apache Hudi, Iceberg…Lakeview enhances the current observability metrics provided by open-source Hudi project (as well as for Iceberg/Delta Lake), with context…2d ago2d ago
Albert WonginData Engineer ThingsHow does onehouse.ai differ from other established lakehousesQuestion: How does onehouse.ai differ from other established lakehouses from the likes of Databricks, Snowflake, Cloudera, AWS Data Lake…2d ago2d ago
Albert WongHow CelerData optimized its GTM motion to drive a 75% increase in signal-sourced pipelineOne of my proudest achievements at #CelerData / #StarRocks was contributing to the success of our community qualified leads (#CQL)…Jun 6Jun 6
Albert WongWhat you need to have spark read and write in S3 (specifically apache iceberg, apache hudi, delta…So spark is a bit of a pain. If you want Spark to write into S3 buckets you need 2 major pieces.May 31May 31
Albert WongHow to register Apache Hudi open table format files into Apache Hive Metastore (HMS)A Hudi table can directly be synced to the Hive Metastore using Hive Sync Tool and subsequently be queried by different query engines. For…May 30May 30
Albert WongNew kid on the block: CedarDBThe most interesting thing in data: Umbra which took the leader’s top spot (faster query performance than DuckDB, ClickHouse, Doris /…May 29May 29
Albert WongHow to use VS code java debugger and Kafka Connect inside of Docker ComposeYou need to bind *:5005 and KAFKA_DEBUG=y won't do this. I found out the hard way and found this…Apr 25Apr 25
Albert WonginDev GeniusStreamlining Analytics: Kappa Architecture with StarRocks for Big DataThe ever-growing volume of data in today’s world demands real-time insights for effective decision making. Big data analytics has become…Apr 5Apr 5
Albert WonginData Engineer ThingsQuickstart on Open Data Lakehouse with StarRocks + S3 + Delta LakeDelta Lake, the open-source storage layer on top of data lakes like S3, brings reliability and structure to your data. But querying that…Apr 3Apr 3