|
|
MapR is fully compatible with Apache Hadoop. See discussion to question 'Is MapR wire compatible or API compatible with Hadoop 0.20.2?' which should address your first question. The NFS mount is an extra feature that will give you a convenient way to get your data in and out of the filesystem and a way to view and manipulate your data (files) via standard filesystem utilities. If using the Hadoop API method is working for you I think you should continue to do so. As for your question on performance, see discussion on question 'Which is more performant: using MapR over NFS or via HDFS?' which addresses the same thing. |
|
Regarding installation.. Yes, you'll need to run configure.sh on each node. The documentation for installation is located here. If you have keyless ssh setup between your cluster nodes, you can wrap the installation steps in a script and execute it from a single node. |
|
Regarding this specific question:
What I typically do is leverage passwordless SSH and a utility such as "parallel-ssh" (or pssh for short). This makes it relatively easy and painless to install, configure and start MapR. Here is some example output from one of my lab machines:
If you use the pssh utility as shown above then you can install/setup/start the MapR software as follows:
Thanks Arvind, Santosh and Aaron for your timely replies
(13 Jul '11, 02:11)
ghousia
|