► Hadoop and Friends: Java/Python/Scala, Apache Hadoop, HDFS, Map Reduce, SQL, MySQL, Data Ware House, HBase, Pig, Hive, Oozie, Sqoop, Flume,Zookeeper, Junit/unittest, Git, PIP/Maven, Linux Commands and Shell Scripts
► Spark Data Pipeline: Spark, PySpark Spark-SQL, Spark-Streaming, Spark-ML, Machine Learning, Regressions, (Linear, Multi-Linear, Logistic), Clustering, K-Mean, KNN, NaiveBayes, Classification, Decision Trees, Random Forest
► Cloud Computing, AWS Hosting and Deployment, EC2, IAM, LSB, Load Balancer (LBS), Availability Group, Security Configuration, Docker, Kubernetes,
► New Gen Big Data Tools: Mahout, Recommendation, Storm, Flink, Samza,SAMOA, Apex, Beam, Tez: (as Per Batch conditions)