HPE Ezmeral Software platform
1847776 Members
3210 Online
104021 Solutions
New Discussion

Error starting ResourceManager

 
SOLVED
Go to solution
Satish_Infa
Advisor

Error starting ResourceManager

2024-11-25 09:43:53,002 ERROR org.apache.hadoop.yarn.server.resourcemanager.ResourceManager: Error starting ResourceManager
org.apache.hadoop.service.ServiceStateException: org.apache.hadoop.thirdparty.protobuf.InvalidProtocolBufferException: Protocol message contained an invalid tag (zero).
at org.apache.hadoop.service.ServiceStateException.convert(ServiceStateException.java:105)
at org.apache.hadoop.service.AbstractService.start(AbstractService.java:203)
at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.startActiveServices(ResourceManager.java:1431)
at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$1.run(ResourceManager.java:1472)
at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$1.run(ResourceManager.java:1468)
at java.base/java.security.AccessController.doPrivileged(Native Method)
at java.base/javax.security.auth.Subject.doAs(Subject.java:423)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:2013)
at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.transitionToActive(ResourceManager.java:1468)
at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.serviceStart(ResourceManager.java:1521)
at org.apache.hadoop.service.AbstractService.start(AbstractService.java:194)
at org.apache.hadoop.yarn.server.resourcemanager.ResourceManager.main(ResourceManager.java:1731)

 

Steps tried to troubleshoot but didn't help

https://support.hpe.com/hpesc/public/docDisplay?docId=sf000079477en_us&docLocale=en_US

1) Take the backup of Epoch file.

      hadoop fs -get  /var/mapr/cluster/yarn/rm/system/FSRMStateRoot/EpochNode /tmp/

2) Remove the EpochFile from state store.

      hadoop fs -rm /var/mapr/cluster/yarn/rm/system/FSRMStateRoot/EpochNode

 3) Try Restart the Resource Manager.

 

2 REPLIES 2
hiteshingole
HPE Pro
Solution

Re: Error starting ResourceManager

There seems ot be problem with the stateStore. If the state of the running applications(Prior to failure ) are not important can you please try removing the satestore completly and share your obsevations.

 

-stop RM
-make a statestore backup
hadoop fs -mv /var/mapr/cluster/yarn/rm/system/FSRMStateRoot/* /FSRMStateRoot_backup/
-remove the directory
hadoop fs -rmr /var/mapr/cluster/yarn/rm/system/FSRMStateRoot/
-start RM

 

I'm an HPE employee.
[Any personal opinions expressed are mine, and not official statements on behalf of Hewlett Packard Enterprise]
Accept or Kudo
support_s
System Recommended

Query: Error starting ResourceManager

Hello,

 

Let us know if you were able to resolve the issue.

 

If you have no further query, and you are satisfied with the answer then kindly mark the topic as Solved so that it is helpful for all community members.

 

Please click on "Thumbs Up/Kudo" icon to give a "Kudo".

 

Thank you for being a HPE valuable community member.


Accept or Kudo