DNS issues in the Hadoop/HBase stack #12

nvtkaszpir · 2020-04-04T14:45:01Z

Looks like most of the stack pods are failing to start after pod termination because of DNS entries are not registered fast enough in headless services (depends on cloud venodr, in GKE it's up to 60s), for example:

hdfs-namenode-0 namenode java.lang.IllegalArgumentException: java.net.UnknownHostException: hdfs-namenode

This may also influence other services (such as kafka).

Fix:

set in headless services spec.publishNotReadyAddresses: true, which will enforce registration of dns hosts even if they are not ready. Ready state is based on pod readiness/liveness probes, if they pass, the given pod is added to the endpoints. In this setup enforcing publishing DNS entries which are not read is not an issue, actually this is expected because the way Hadoop stack was designed. So the DNS entries should be added, while appropriate java apps will handle the actual availability of the processes within pods.

Reference:

It's not DNS

There's no way it's DNS

It was DNS

The text was updated successfully, but these errors were encountered:

This was referenced Apr 7, 2020

Hdfs fixes #16

Merged

Hbase updates #17

Merged

[incubator/zookeeper] Cannot open channel to 1 at election address xxxxxxx-zookeeper-0.zk-zookeeper-headless.default.svc.cluster.local.:3888 helm/charts#21865

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DNS issues in the Hadoop/HBase stack #12

DNS issues in the Hadoop/HBase stack #12

nvtkaszpir commented Apr 4, 2020 •

edited

Loading

DNS issues in the Hadoop/HBase stack #12

DNS issues in the Hadoop/HBase stack #12

Comments

nvtkaszpir commented Apr 4, 2020 • edited Loading

nvtkaszpir commented Apr 4, 2020 •

edited

Loading