Sunday 27 August 2017

About Big Data

Before to start exploration on Big data, let's think about how the big data comes, in picture??

The below are the reasons behind the big data comes in picture:
  1. Evolution of technology
  2. IOT(Internet Of Things)
  3. Social Media
  4. Other factors
Let's see briefly:
 
1)Evolution of technology:


Earlier we had land line phones, But nowadays,we have android,IOS smart phones, to make our life smarter. so just think, for each  operation which we perform on smart phones, generates a data, that resides somewhere

Desktops are the source to handle operations, i mean to store and process using storage devices like floppy,discs,taps,..etc.,

But in these days, Hard disks,cloud storage plays a vital role.

Earlier , we are in the hand of Analog storage, but these days almost of Digital storage. and also about the evolution of car, self driving car,




2)IOT(Internet Of Things):

IOT connects physical device to Internet and makes device smarter.

Example:
Smart TV's, Smart Ac's, Smart Car's etc.,



3)Social Media:
Data generation on social media sites,
  • Facebook likes,videos,photos,tags,comments etc.,
  • Tweeter tweets,
  • Youtube video uploads
  • Instagram pics,
  • Emails

4)Other Factors:

  • Retail
  • Banking & Finance,
  • Media & Entertainment
  • Health care,
  • Education areas,
  • Government,
  • Transportation, Insurance etc.,

Note: Assumption by 2020, 50 billions IOT devices will in the world.

Big Data:

Big data is a term for data sets that are so large or complex or even huge or massive volume of both structured and unstructured data that traditional data processing application software is inadequate to deal with big data or difficult to process.

Note: Big Data is not a technology, it's paradigm(pattern) shift.


To determine which data is considered as Big Data,  we have some
Characteristics of big data:(5V's of Big Data):


1)V- Volume:
Amount of data being generating and generated.



2)V- Variety:
Different kinds of data , that is being generated from various sources.



Types of data:
  1. Structured data - Tables
  2. Semi-structured data - CSV,JSON,EMAILS,TSV,XML
  3. Unstructured data - Videos, images, Logs, Audio files
3)V- Velocity:
The speed at which the data is being generated and processed to meet the demands.
Data is being generated at alarming rate.


4)V- Value:
Mechanism to bring the correct meaning out of huge data.

5)V- Veracity:
Uncertainty and inconsistencies in the data, i.e., The quality of captured data can vary greatly, affecting accurate analysis.


Problems with Big Data:
Problem 1:Storing exponentially growing large data sets in a non-distributed system.

Problem 2:Processing variety of data i.e., complex structure data.

Problem 3:Processing data faster

To put a solution for those above problems , Hadoop comes and plays a vital role.

Solutions with Hadoop:

Problem 1:Storing exponentially growing large data sets in a non-distributed system.

Solution: HDFS
  • It is storage part of Hadoop
  • Distributed File system,
  • Divides files into smaller chunks and stores across the cluster.
  • Scalable as per requirement(Scalability)

Problem 2:Storing varies of data.

Solution: HDFS
  • HDFS allows to store any kind of data,(Structured,semi-structured or unstructured)
  • No schema validation in HDFS while dumping data
  • Follows WORM (Write once Ream Many)


Problem 3:
Processing data faster

Solution: MapReduce
  • Parallel execution of data present in HDFS
  • Allows to process the data locally, i.e., each node responsible for data processing which stored on it.
Big data as an opportunity to bring below:



Big data use cases:
Below are some of the Big data use cases from different domains:
  •  Improve Customer Experience
  •  Sentiment analysis
  •  Customer Churn analysis
  •  Predictive analysis
  •  Real-time ad matching and serving
                                                                                                                                                  Next_Page

16 comments:

  1. Advance Hadoop training in Delhi every day technology changes and we are learning more technical things to support ourself in the computative world,so our Aptron Solutions provides more knowledge with real time projects and experienced staff, they have 10 to 20 years experience in training and numerous students are placed in top and best corporate companies.We give both online training and classroom training with student flexibility.
    For More Info: Hadoop Course in Delhi

    ReplyDelete
  2. https://bigdata.openkbs.org/2017/04/big-data-analytics-using-tda-as-first.html?showComment=1590262238730#c5149126500545273178

    ReplyDelete
  3. Positive site, where did u come up with the information on this posting? I'm pleased I discovered it though, ill be checking back soon to find out what additional posts you include. Data Blending in Tableau

    ReplyDelete
  4. I feel very grateful that I read this. It is very helpful and very informative and I really learned a lot from it.
    ai training in mysore

    ReplyDelete
  5. I am really enjoying reading your well written articles. It looks like you spend a lot of effort and time on your blog. I have bookmarked it and I am looking forward to reading new articles. Keep up the good work.
    ai training in varanasi

    ReplyDelete
  6. Excellent Blog! I would like to thank for the efforts you have made in writing this post. I am hoping the same best work from you in the future as well. Thanks for sharing. Great websites! I would like to add a little comment data science training in hyderabad

    ReplyDelete
  7. Great post i must say and thanks for the information. Education is definitely a sticky subject. However, is still among the leading topics of our time. I appreciate your post and look forward to more.
    data science institute in hyderabad
    data science course hyderabad
    data analytics courses in hyderabad

    ReplyDelete
  8. Thanks for sharing the post.. parents are worlds best person in each lives of individual..they need or must succeed to sustain needs of the family.
    Ciencia de Datos México

    ReplyDelete
  9. I just found this blog and have high hopes for it to continue. Keep up the great work, its hard to find good ones. I have added to my favorites. Thank You. big data training london

    ReplyDelete
  10. Creative Web Studio - The Cyber Defense Company bietet als zertifiziertes Unternehmen lösungsorientierte und zeitgemässe ICT-Services für KMUs an Hauptfokus: Cloud, IT-Security und Informatik.Forensic

    ReplyDelete
  11. Very informative post! There is a lot of information here that can help any business get started with a successful social networking campaign. big data analytics

    ReplyDelete
  12. Great blog, amazed with the subject you have developed the content. These kind of posts really helpful to gain the knowledge of unknown things
    AWS Training in Hyderabad

    ReplyDelete
  13. I found so many interesting stuff in your blog especially its discussion. From the tons of comments on your articles, I guess I am not the only one having all the enjoyment here! keep up the good work... https://besttapestorage.weebly.com/blog/tape-stockpiling-in-singapore-what-are-your-choices

    ReplyDelete
  14. Nice Piece Of Information, Keep Sharing Such Informative Post.

    big data hadoop course

    Call on 7070905090 To Join Ducat Today

    ReplyDelete

Fundamentals of Python programming

Fundamentals of Python programming: Following below are the fundamental constructs of Python programming: Python Data types Python...