Month: <span>April 2019</span>

Month: April 2019

Big Compressed File Will Affect Query Performance for Impala

As we know, Hadoop/HDFS/MapReduce/Impala is designed to store and process large amount of data, in terms of TBs or PBs. And we also know that having too many small files will hurt query performance, because NameNode needs to store millions of metadata to hold the information about files being stored …

Loading

Australian Citizenship

After studying, working and living in Melbourne, Australia for almost 20 years, today marks my first day as an Australian Citizen. I arrived in Melbourne on 15th of January, 2000, as a high school student, at the age of 17, 3 months before my 18th birthday. After 4 years of …

Loading

My new Snowflake Blog is now live. I will not be updating this blog anymore but will continue with new contents in the Snowflake world!