High Scalability of HDFS Using Distributed Namespace

dc.contributor.authorSingh, Harcharan Jit
dc.contributor.supervisorSingh, V. P.
dc.date.accessioned2012-07-25T08:15:25Z
dc.date.available2012-07-25T08:15:25Z
dc.date.issued2012-07-25T08:15:25Z
dc.descriptionM.E. (CSED)en
dc.description.abstractIn data intensive computing, Hadoop is widely used by organizations. The client applications of Hadoop require high availability and scalability of the system. Mostly, these applications are online and their data growth rate is unpredictable. The present Hadoop relies on secondary namenode for failover which slows down the performance of the system. Hadoop system’s scalability depends on the vertical scalability of namenode server. As the namespace of Hadoop distributed file system grows, it demands additional memory to cache. If namenode server does not have enough primary memory to cache the namespace, its performance and availability effects. A new Hadoop architecture has been proposed to address the issues of namenode scalability, single point of failure and Implementation security of Hadoop. This approach is based on distribution of namespace using distributed hash tables. The growing size of namespace of HDFS is divided into multiple name node servers. The proposed architecture of Hadoop is simulated by using the multiple name node servers. The name node are arranges in chord ring. This allows HDFS to scale up horizontally. By simply adding namenode to name node ring, the scalability, availability and performance of HDFS improves. The system provides decartelize managed approach for namespace distribution which gives consistent performance. The results of HDFS namespace to store 1 billion or above files are discussed in this research work. The proposed architecture has shown high availability and adapts to name node failure.en
dc.description.sponsorshipComputer Science and Engineering Department, Thapar University, Patialaen
dc.format.extent13974113 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10266/1786
dc.language.isoenen
dc.subjectHigh Scalabilityen
dc.titleHigh Scalability of HDFS Using Distributed Namespaceen
dc.typeThesisen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1786.pdf
Size:
5.02 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.79 KB
Format:
Item-specific license agreed upon to submission
Description: