Hadoop Solutions For Big Data

ARTSOFT and Cloudera Solutions


The Challenge

There is a tremendous amount of information driven by the ever changing applications, from structured, unstructured, to semi-structure data. Conventional IT infrastructure is not built to handle the variety, velocity and volume of the data produced by social media networks, mobile applications, machine sensors and scientific researches, etc. For Enterprises, utilizing big data analytics is no longer a question of when, it is a question of how. Hadoop, designed for the cost effective storage and processing of large volumes of data, is born for this purpose. It can linearly scale up to thousands of servers and petabytes of storage.

How to take advantage of Hadoop technology and gain competitive edge is on the mind of almost every corporate CIO. For enterprises, how to deploy the Hadoop infrastructure efficiently means winning or losing in the big battle of market share. Enterprises deploying Hadoop solutions often spend large amount of resource searching for the best architecture and the most capable solution provider. This is where Supermicro comes in to help.

The Solution

Introducing Supermicro Hadoop clusters, a series of optimized big data solutions that provide high performance, high reliability and high scalability. Supermicro Hadoop solutions are fully integrated, fully optimized and completely tested turnkey clusters with flexible support packages available to meet customer specific requirements.

ARTSOFT Hadoop clusters feature industry proven high density compute and storage servers populated with best of breed components selected through extensive engineering design, validation and testing. Certified configurations take the guess work out of designing and deploying a truly scalable Big Data compute and storage infrastructure that meets the most demanding enterprise IT and data center environments.

ARTSOFT Advantage

  • Designed from ground up with optimal server and cluster configurations that meet a variety of workloads
  • Proven solutions based on extensive lab testing and large scale production deployments
  • Achieve the best price/performance and the best price/capacity with industry leading server and storage platforms
  • End-to-end turnkey Hadoop clusters with completely integrated HW, SW and Global Support
  • Advanced architecture based on the latest technologies such as SkyLake CPUs, SAS3, NVMe, Optane Drives 
  • Enhanced networking performance and redundancy with dual 10GbE / 25GbE / 40GbE / 100GbE options
  • Automated full cluster testing guarantees build quality and delivery schedule

Fully Integrated Hadoop Cluster

Key Features and benefits:

  • Purpose built cluster configurations optimized for capacity, compute or IO performance
  • High availability Name Node design with no single point of failure
  • Large memory options designed specifically for Spark and other in memory, low latency computations
  • Hyper-Scale server platforms designed for extremely large deployments
  • High density compute, storage and memory design to achieve the best efficiency and lowest TCO
  • Flexible network switch options with 1 or 2x 10G / 25G / 100G switches per rack.
  • Cost effective 14U rack design, ideal for Proof of Concept testing environment
  • Standard 42U rack design and flexible PDU options that meet any data center environment
  • Up to Titanium Level (96%+) Efficiency - Redundant Power Supplies with PMBus
  • Built in with IPMI and SMC OOB (out of band management) suite for automated cluster management
  • Fully integrated, fully configured and completely tested with Hadoop distributions of your choice
  • Proof of Concept testing cluster available for risk free purchasing experience
Supermicro fully integrated Hadoop cluster solution rack
  • 1 or 2x 48 port 10G SFP+ / 10GBase-T / 25GbE
    1 or 2x 32 port 100GbE, 1x 48 port Switch, GbE
  • 1x Management Node 1U UP Skylake 41xx / 51xx
  • 3x Name Nodes 1U DP Xeon Skylake 41xx / 51xx / 61xx
  • Optimized Data nodes 2U SSG, 2U BigTwin or 4U FatTwin with Skylake 41xx / 51xx / 61xx / 81xx processors, dual 10G / 25G / 40G per node, 2.5" and 3.5" HDD options
  • Standard 42U rack with metered PDUs, rack customization options available
  • Integration service includes full cluster Burn-in and testing, BIOS and FW update, networking configuration, Pre-install Hadoop distribution of choice, and full cluster


Hadoop Cluster Technical Specifications

  High Capacity IO Optimized High Density Compute Balanced
Data Node SSG-6029P-E1CR24L SYS-2029BT-HNR SYS-F629P3-RC0B SSG-6019P-ACR12L
Data Node (Qty) 18 32 36 37
Form Factor 2U SuperStorage 2U BigTwin 4U FatTwin 1U SuperStorage
2x SKL 4114 2P 10C/20T 2.2G 85W 2x SKL 5118 4/2P 12C/24T 2.3G 105W 2x SKL 6130 4/2P 16C/32T 2.1G 125W 2x SKL 5118 4/2P 12C/24T 2.3G 105W
128GB 192GB 256GB 128GB
24 Bay 3.5" 6 Bay 2.5" 8 Bay 3.5" 12 Bay 3.5"
Total Data Drive 432 448 (NVMe)* 288 444
Total Cores 360 768 1152 888
Total Memory 2.3TB 6.144TB 9.2TB 4.7TB
Total Storage 3.45PB (8TB) 1.792PB (4TB) 1.15PB (4TB) 2.66PB (6TB)
Name Node 3x 1U WIO 3x 1U WIO 3x 1U WIO 3x 1U WIO
Switches 1x 48PT 25GBase-T 2x 32PT 40G 1x 48PT GbE 1x 48PT10G SFP+
Cabinet (W x H x D) 42U
23.5 x 82.4 x 48
PDU 2x 50A 208 3-Phase Metered PDU
*ADD 32 Bay 2.5" Drives x8 JBOF = 256 NVMe