Big Data Analytics

Unit I
Data Management & Introduction to Big Data Tools (NOS 2101)
  • Design Data Architecture and manage the data for analysis,
  • Understand various sources of Data like Sensors/signal/GPS etc.
  • Export all the data onto Cloud ex. AWS/Rackspace etc.
  • Introduction to Big Data tools like Hadoop, Spark, Impala etc.,
  • Data ETL process, Identify gaps in the data and follow-up for decision making.

Unit II
Big Data Analytics & Machine Learning Algorithms (NOS 2101)

  • Run descriptive to understand the nature of the available data, collate all the data sources to suffice business requirement,
  • Run descriptive statistics for all the variables and observe the data ranges, Outlier detection and elimination.
  • Hypothesis testing and determining the multiple analytical methodologies,
  • Train Model on 2/3 sample data using various Statistical/Machine learning algorithms,
  • est model on 1/3 sample for prediction etc.

Big Data Analytics

  • Hadoop, HIVE, PIG, Scoop , Spark, Impala