Introduction big data

Views:
 
Category: Education
     
 

Presentation Description

Big data is a term that describes the large volume of data – both structured and unstructured – that inundates a business on a day-to-day basis. But it's not the amount of data that's important. ... Big data can be analyzed for insights that lead to better decisions and strategic business moves.

Comments

Presentation Transcript

slide 1:

Big Data Introduction By Professionalguru

slide 2:

 Introduction ◦ What is Big data ◦ Why Big-Data ◦ When Big-Data is really a problem  Techniques  Tools  Applications  Literature http://professional-guru.com

slide 3:

 ‘Big-data’ is similar to ‘Small-data’ but bigger  …but having data bigger consequently requires different approaches: ◦ techniques tools architectures  …to solve: ◦ New problems… ◦ …and old problems in a better way. http://professional-guru.com

slide 4:

From “Understanding Big Data” by IBM http://professional-guru.com

slide 5:

http://professional-guru.com

slide 6:

Big-Data

slide 7:

 Key enablers for the growth of “Big Data” are: ◦ Increase of storage capacities ◦ Increase of processing power ◦ Availability of data http://professional-guru.com

slide 9:

http://professional-guru.com

slide 10:

http://professional-guru.com

slide 11:

http://professional-guru.com

slide 12:

http://professional-guru.com

slide 13:

http://professional-guru.com

slide 14:

http://professional-guru.com

slide 15:

http://professional-guru.com

slide 16:

 NoSQL ◦ DatabasesMongoDB CouchDB Cassandra Redis BigTable Hbase Hypertable Voldemort Riak ZooKeeper  MapReduce ◦ Hadoop Hive Pig Cascading Cascalog mrjob Caffeine S4 MapR Acunu Flume Kafka Azkaban Oozie Greenplum  Storage ◦ S3 Hadoop Distributed File System  Servers ◦ EC2 Google App Engine Elastic Beanstalk Heroku  Processing ◦ R Yahoo Pipes Mechanical Turk Solr/Lucene ElasticSearch Datameer BigSheets Tinkerpop http://professional-guru.com

slide 17:

http://professional-guru.com

slide 18:

 …when the operations on data are complex: ◦ …e.g. simple counting is not a complex problem ◦ Modeling and reasoning with data of different kinds can get extremely complex  Good news about big-data: ◦ Often because of vast amount of data modeling techniques can get simpler e.g. smart counting can replace complex model based analytics… ◦ …as long as we deal with the scale http://professional-guru.com

slide 19:

 Research areas such as IR KDD ML NLP SemWeb … are sub- cubes within the data cube Scalability Dynamicity Context Quality Usage http://professional-guru.com

authorStream Live Help