Month: <span>October 2015</span>

Month: October 2015

Impala metadata not synced across all Impala Daemons when Load Balancer is enabled

Recently I have be dealing with quite a few customers with the same issue that Impala metadata out of sync between each Impala Daemons. And the common cause was to do with Load Balancer setup in front of Impala Daemons and the way they run impala queries. So this was …

Hive query failed with error: Killing the Job. mapResourceReqt: 1638 maxContainerCapability:1200

When running a Hive query, get the following error in the jobhistory: This is caused by the following settings in YARN: The solution is to setup the settings mentioned above in the following way: mapreduce.map.memory.mb < yarn.nodemanager.resource.memory-mb < yarn.scheduler.maximum-allocation-mb Then the problem should be resolved.

Kerberos connections to HIveServer2 not working cross domain

The following is the scenario of the cross domain problem with Kerberized cluster: 1. Cluster is within realm “DEV.EXAMPLE.COM” 2. Client is outside cluster with realm “EXAMPLE.COM” 3. Connect to Impala from client machine works 4. Connect to HS2 from client machine does not work and get the following error: …

My new Snowflake Blog is now live. I will not be updating this blog anymore but will continue with new contents in the Snowflake world!