Question-7: HDFS supports HDFS Transparency, what are the advantages of using HDFS Transparency?

  1. Hadoop applications can run unmodified over other Supported Object Storage.
  2. It provides, Immediate support for Hadoop applications and ISVs
  3. Reuse HDFS client as-is

Answer:

Exp: CDP Private Cloud Base uses IBM Spectrum Scale custom service descriptor (CSD) to integrate with IBM Spectrum Scale. The CSD is a file that describes a product for use with Cloudera Manager. Cloudera Manager can then support configuration, distribution, and monitoring of that product. 

Starting from HDFS Transparency and IBM Spectrum Scale, HDFS Transparency is integrated with the IBM Spectrum Scale installation toolkit and Cluster Export Services (CES). The integration of CES with HDFS Transparency as shown below is called IBM Spectrum Scale CES HDFS Transparency, or CES HDFS.

IBM Spectrum Scale Cluster Export Services (CES) provides different protocol services, such as Network File System (NFS), Object, Hadoop Distributed File System (HDFS), or Server message Block (SMB) to an IBM Spectrum Scale cluster. 

The installation toolkit can install HDFS Transparency as part of the CES protocol stack. The CES interface can now control and configure HDFS Transparency by using the same interfaces as with the other protocols. The CES protocol manages the HDFS Transparency NameNodes only. The HDFS Transparency DataNodes are not part of the CES protocol nodes.

IBM Spectrum Scale HDFS Transparency implementation integrates the NameNodes and the DataNodes services. It responds to the request as though it were HDFS on IBM Spectrum Scale file system. As show below 


Other Popular Courses