Questions tagged [mongodb-hadoop]
29 questions
                    
                    3
                    
            votes
                
                1 answer
            
        How to delete documents(records) with Mongo-Hadoop connector for Spark
I am using Mongo-Hadoop connector to work with Spark and MongoDB.I want to delete the documents in an RDD from the MongoDB,looks there is a  MongoUpdateWritable to support document update. Is there way to do deletion with Mongo-Hadoop…
         
    
    
        Tom
        
- 5,848
- 12
- 44
- 104
                    3
                    
            votes
                
                1 answer
            
        Save MongoDB data to parquet file format using Apache Spark
I am a newbie with Apache spark as well with Scala programming language.
What I am trying to achieve is to extract the data from my local mongoDB database for then to save it in a parquet format using Apache Spark with the hadoop-connector
This is…
         
    
    
        Nahum Fabián Huezo
        
- 41
- 1
- 5
                    2
                    
            votes
                
                1 answer
            
        Spark Task not Serializable Hadoop-MongoDB-Connector Enron
I am trying to run the EnronMail example of Hadoop-MongoDB Connector for Spark.
Therefore I am using the java code example from…
         
    
    
        Ulrich Zendler
        
- 141
- 1
- 10
                    2
                    
            votes
                
                1 answer
            
        "ERROR 6000, Output location validation failed" using PIG MongoDB-Hadoop Connector on EMR
I get an "output location validation failed" exception in my pig script on EMR.
It fails when saving data back S3.
I use this simple script to narrow the problem:
REGISTER /home/hadoop/lib/mongo-java-driver-2.13.0.jar  
REGISTER…
         
    
    
        d0x
        
- 11,040
- 17
- 69
- 104
                    2
                    
            votes
                
                1 answer
            
        Apache Spark Mongo-Hadoop Connector class not found
So im trying to run this example https://github.com/plaa/mongo-spark/blob/master/src/main/scala/ScalaWordCount.scala
But i keep getting this error 
 Exception in thread "main" java.lang.NoClassDefFoundError: com/mongodb/hadoop/MongoInputFormat
at…
         
    
    
        user1290942
        
- 88
- 8
                    2
                    
            votes
                
                2 answers
            
        Hadoop with MongoDB Concept
Hi I am new to Hadoop and NoSQL technologies. I started learning with world-count program by reading file stored in HDFS and and processing it. Now I want to use Hadoop with MongoDB. Started program from here . 
Now here is confusion with me that it…
         
    
    
        Abhendra Singh
        
- 1,959
- 4
- 26
- 46
                    2
                    
            votes
                
                4 answers
            
        Update an existing collection in MongoDB using Java-Hadoop connector
Is it possible to update existing MongoDB collection with new data. I am using hadoop job to read write data to Mongo. Required scenario is :- 
Say first collection in Mongo is 
{ 
  "_id" : 1,
  "value" : "aaa"
  "value2" : null
}
after reading…
         
    
    
        Abhishek bhutra
        
- 1,400
- 1
- 11
- 29
                    1
                    
            vote
                
                0 answers
            
        mongo-hadoop java connector Iterate through all collections
I am trying to use this hadoop mongo connector,
https://github.com/mongodb/mongo-hadoop
I have seen many examples of connecting to a particular mongo collection using something like this, 
mongodbConfig.set("mongo.input.uri",…
         
    
    
        user3400864
        
- 17
- 1
- 5
                    1
                    
            vote
                
                1 answer
            
        Spark Mongo Hadoop Connector not mapping data
I am attempting to map data from mongodb-hadoop connector inside a spark application. I have not other errors prior to this one so im assuming that the connection to mongodb was successful. im using the following code to map:
JavaRDD logs =… 
         
    
    
        D.Asare
        
- 103
- 3
- 14
                    1
                    
            vote
                
                2 answers
            
        Hadoop with mongoDB : NoClassDefFoundError MongoConfigUtil
I'm learning how to write a map / reduce job in hadoop with mongodb data as input. So I followed this example,  but I got following error :
 Exception in thread "main" java.lang.NoClassDefFoundError: com/mongodb/hadoop/util/MongoConfigUtil
    at…
         
    
    
        Namsi Abdelkhalek
        
- 11
- 2
                    1
                    
            vote
                
                1 answer
            
        Hive Table Creation Using MongoDB Hadoop Driver
I am trying to connect from a Hive Database to a collection in MongoDB using a driver (jars) provided on the wiki site. Here are the steps I did: -
I created a collection in MongoDB called "Diamond" under a database called "Moe" and it has got 20…
         
    
    
        Mario
        
- 35
- 5
                    1
                    
            vote
                
                1 answer
            
        mongo-hadoop. not to handle mongodb document deletion
I want to synchronize mongodb and hadoop, but when I delete document from mongodb, this document must not be deleted in hadoop. 
I tried using mongo-hadoop and hive. this is hive query:
CREATE EXTERNAL TABLE SubComponentSubmission
(
  id STRING,
 …
         
    
    
        irakli2692
        
- 127
- 2
- 9
                    1
                    
            vote
                
                0 answers
            
        Getting error " Hadoop Release '%s' is an invalid/unsupported release. Valid entries are in 2.6.0"
I am working on mongodb-hadoop connector. For this process, i am building mongodb adapter,after edited build.sbt file,i am trying to building adapter like ./sbt package then i am getting error 
Hadoop Release '%s' is an invalid/unsupported…
         
    
    
        Prabjot Singh
        
- 4,491
- 8
- 31
- 51
                    1
                    
            vote
                
                0 answers
            
        MongoDB Hadoop error : no FileSystem for scheme:mongodb
I'm trying to get a basic Spark example running using mongoDB hadoop connector. I'm using Hadoop version 2.6.0. I'm using version 1.3.1 of mongo-hadoop. I'm not sure where exactly to place the jars for this Hadoop version. Here are the locations…
         
    
    
        Navin Viswanath
        
- 894
- 2
- 13
- 22
                    1
                    
            vote
                
                1 answer
            
        MongoDB Hadoop connector streaming not running
I want to launch the MongoDB Hadoop Streaming connector, so I downloaded a compatible version of Hadoop (the 2.2.0) (see https://github.com/mongodb/mongo-hadoop/blob/master/README.md#apache-hadoop-22)
I cloned the git repository mongohadoop, changed…
         
    
    
        Julien Fouilhé
        
- 2,583
- 3
- 30
- 56