When i run my job on a larger dataset, lots of mappers / reducers fail causing the whole job to crash. Here's the error i see on many mappers:

java.io.FileNotFoundException: File does not exist: /mnt/var/lib/hadoop/tmp/mapred/staging/hadoop/.staging/job_201405050818_0001/job.split at org.apache.hadoop.hdfs.DFSClient$DFSInputStream.op enInfo(DFSClient.java:1933) at org.apache.hadoop.hdfs.DFSClient$DFSInputStream.(D FSClient.java:1924) at org.apache.hadoop.hdfs.DFSClient.open(DFSClient.ja va:608) at org.apache.hadoop.hdfs.DistributedFileSystem.open( DistributedFileSystem.java:154) at org.apache.hadoop.fs.FileSystem.open(FileSystem.ja va:429) at org.apache.hadoop.mapred.MapTask.getSplitDetails(M apTask.java:385) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapT ask.java:417) at org.apache.hadoop.mapred.MapTask.run(MapTask.java: 377) at org.apache.hadoop.mapred.Child$4.run(Child.java:25 5) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.do As(UserGroupInformation.java:1132) at org.apache.hadoop.mapred.Child.main(Child.java:249 ) Has anybody been able to solve this problem ? I see another human experiencing the same pain as me (here), sadly he could not be saved in time.


Check Solution