Introduction big data

Category: Education

Presentation Description

Big data is a term that describes the large volume of data – both structured and unstructured – that inundates a business on a day-to-day basis. But it's not the amount of data that's important. ... Big data can be analyzed for insights that lead to better decisions and strategic business moves.


Presentation Transcript

slide 1:

Big Data Introduction By Professionalguru

slide 2:

 Introduction ◦ What is Big data ◦ Why Big-Data ◦ When Big-Data is really a problem  Techniques  Tools  Applications  Literature

slide 3:

 ‘Big-data’ is similar to ‘Small-data’ but bigger  …but having data bigger consequently requires different approaches: ◦ techniques tools architectures  …to solve: ◦ New problems… ◦ …and old problems in a better way.

slide 4:

From “Understanding Big Data” by IBM

slide 5:

slide 6:


slide 7:

 Key enablers for the growth of “Big Data” are: ◦ Increase of storage capacities ◦ Increase of processing power ◦ Availability of data

slide 9:

slide 10:

slide 11:

slide 12:

slide 13:

slide 14:

slide 15:

slide 16:

 NoSQL ◦ DatabasesMongoDB CouchDB Cassandra Redis BigTable Hbase Hypertable Voldemort Riak ZooKeeper  MapReduce ◦ Hadoop Hive Pig Cascading Cascalog mrjob Caffeine S4 MapR Acunu Flume Kafka Azkaban Oozie Greenplum  Storage ◦ S3 Hadoop Distributed File System  Servers ◦ EC2 Google App Engine Elastic Beanstalk Heroku  Processing ◦ R Yahoo Pipes Mechanical Turk Solr/Lucene ElasticSearch Datameer BigSheets Tinkerpop

slide 17:

slide 18:

 …when the operations on data are complex: ◦ …e.g. simple counting is not a complex problem ◦ Modeling and reasoning with data of different kinds can get extremely complex  Good news about big-data: ◦ Often because of vast amount of data modeling techniques can get simpler e.g. smart counting can replace complex model based analytics… ◦ …as long as we deal with the scale

slide 19:

 Research areas such as IR KDD ML NLP SemWeb … are sub- cubes within the data cube Scalability Dynamicity Context Quality Usage

authorStream Live Help