Hadoop Big Data quick summary Hadoop – is a Java based programming framework that supports the processing of large data sets in a distributed computing environment Hadoop – is based on Google File System (GFS) Hadoop – uses thousands of nodes this is the key to improve performance. Hadoop – is a Distributed File System […]
Apache Cassandra Apache Cassandra Apache Cassandra is an open source, freely distributed, high-performance, extremely scalable, and fault-tolerant post relational database
Apache Spark Apache Spark is a powerful open source processing engine and general MapReduce like engine used for large-scale data processing.
Apache Ambari Apache Ambari is a completely open operational tool or framework for provisioning, managing, and monitoring Apache Hadoop clusters.
Apache poi selenium webdriver supports Selenium projects. Apache POI gives Java libraries for creating, reading and writing Microsoft format files such as Word, PowerPoint and Excel using Java language. You can read/write on excel after importing sheet data in selenium web driver. For running Selenium scripts, Eclipse IDE can be used, Apache POI gives Java […]