Hadoop on Amazon EC2 : Job tracker not starting properly | ansaurus

tags:

views:

190

answers:

1

+2 Q:

Hadoop on Amazon EC2 : Job tracker not starting properly

We are running Hadoop on Amazon EC2 cluster. We start the master, slaves and attach the ebs volumes and finally waiting for hadoop jobtracker, tasktracker etc to start and we have timeout of 3600 seconds. We are noticing 50% of the time that job tracker is not able to start before the timeout. Reason being, hdfs is not initialized properly and still in safemode and job tracker is unable to start. I noticed few connectivity issues between nodes on EC2 as I tried manually pinging slaves.

Did anyone face similar issue and know how to solve this?

A:

I'm not sure, whether this issue is related to Amazon EC2. I had this problem very often too - although I had a pseudo-distributed installation on my machine.

In these cases I could turn the safemode off manually and safely.
Try this command:bin/hadoop dfsadmin -safemode leave

I think you can't do wrong here. It seems to be a buggy feature of hadoop. I used 0.18.3, what version do you run?

Peter Wippermann 2010-05-06 07:51:19

In our case, I actually went and pinged the amazon ec2 instance that hdfs was having problem connecting to and the ping failed. So, I am concerned this is an amazon issue.I am running hadoop 0.20.1

Algorist 2010-05-06 22:28:57

related questions

Hosting, deploying and running web applications in the cloud

What is a good pricing model for Windows Azure?

Can I set the expires header on all objects in an Amazon S3 bucket all at once?

Book recommendation for running ASP.NET/SQL Server on AWS

What is a "Cloud OS"?

What is the easiest way to parallelize my C# program across multiple PCs

Can someone explain the concept of an "instance-hour" as used by cloud computing providers?

How does SQL Server Licensing work on Amazon's EC2?

Windows Azure for web developers vs Amazon EC2

How best to resize images off-server

How to set up a computing cloud and how it works?

Fluffy Cloud Configurations For .NET

Deploying to Amazon EC2

Any good distributed agent/service models for .NET?

Experiences and tips for programming with and for Amazon's cloud servers/apps/tools?

Amazon - EC2 cost?

What alternatives are there to Google App Engine?

What is Cloud computing?

Query points epsilon-close to a cut plane in point cloud using the GPU.

Are off-the-cloud desktop applications dead?

How does dedicated webhosting compare to Amazon's Cloud?

Do you use Amazons Cloud services for your company?

Amazon Web Services

I'm looking for a Windows hosting provider that supports custom os images (like AMZN EC2)

What's the best way to generate a tag cloud from an array? (using h1 through h6 for sizing)