Making Hadoop Secure for Enterprises – An Insight into the Imperative

The Big Data Infra for Enterprises – Making Hadoop Secure for Enterprises session was probably my 4th or 5th attendance to BigDataCloud meetup hosted by DJ and Jeeta Das. Time is luxury and if you login to meetup.com, several meetups with names that generate significant interest crop up. I am sure all agree with me. However, attending BigDataCloud meetup is time well invested. Few days ago I had spoken to J ...

Read more

OSCON Data | July 25-27, Portland, OR – BIGDATACLOUD MEMBERS SAVE 15%

New to the Open Source Conference this year is OSCON Data, for developers pioneering the evolving architectures and tools to manage data. See how Hadoop is used to optimize scalability and reliability at Yahoo. Find out how Facebook utilizes HBase to manage real-time messaging. Why Netflix moved from relational DBs to NoSQL cloud systems for personalized movie choosing. The in-depth sessions at OSCON Data, ...

Read more

Big Data and Hadoop Get Bigger, Amazon Web Services Slash Prices

Big data is getting bigger, and Hadoop is becoming the tool that will allow companies to make full use of their growing amounts of data, says Peter Fenton of Benchmark Capita. Fenton is a partner at the venture capital firm which owns the majority stake of HortonWorks, a startup that separated from Yahoo’s Hadoop dev team, and is offering services around the open-source big data analytics platform. In an in ...

Read more

Linux, Open Source & Ubuntu: Hadoop Data Analytics: 10 Reasons Why It’s Important for Business

Hadoop, the data analytics-for-huge-data-sets invention of Apache Chairman Doug Cutting that found its original home at Yahoo, made some big news this week at the fifth annual Hadoop Summit in Santa Clara, Calif. First, it was revealed that Hadoop officially—but not "spiritually"—will break away from Yahoo and be shepherded by a new VC-funded company called Hortonworks, named after the Dr. Seuss elephant ch ...

Read more

Informatica Adds Support for ‘big Data,’ Hadoop

Informatica is joining the growing ranks of vendors moving to support Hadoop, the open-source framework for large-scale or "big data" processing, the company announced Monday. The 9.1 version of Informatica's platform features a connector to the Hadoop file system (HDFS), allowing customers to move data in and out of Hadoop clusters. While the Hadoop project has its roots in Web companies, having been led b ...

Read more

Google, Yahoo, And Bing Collaborate On Structured Data To Make Search Listings Richer

At la 2006, today, Google, Microsoft, and Yahoo collectively announced that they will be partnering to create schema.org, a resource for site owners and developers to learn about structured data and gain insight into how to improve their sites’ search results. The site adds more than 100 new forms of website markup for content ranging from movies to places in an effort to standardize, and thus improve, how ...

Read more

Hadoop Orchestration

Most data processing tasks with Hadoop require multiple Hadoop jobs with dependencies between them. The dependency arises out of the need for one job to use the output for another job. The dependency between Hadoop jobs can be expressed as a directed acyclic graph (DAG), where each node represents a Hadoop job. DAG is useful for modelling relationship between entities that have partial ordering. The depende ...

Read more

What does the Cloud mean to me?

The Cloud means different things to different people. Everyone has their own views of the Cloud and myriad usage patterns have evolved, even while new and innovative options are emerging everyday. The standard definition of Cloud computing, as quoted in Wikipedia: So in essence, the Cloud is an environment where all aspects of computing are availed of as a service and consumers, be they individuals or busin ...

Read more

The cloud will finally solve the ‘big data’ problem

Innovation around the management of large data sets is coming from the cloud, such as through MapReduce and Hadoop InfoWorld's own Pete Babb provided some good coverage around the "analytics cloud" recently debuted by IBM, called Blue Insight. You can think of Blue Insight as a system that gathers data from those who use it and externalizes the data to those who need it, doing so on a cloud -- a private clo ...

Read more

The SMAQ stack for big data

Storage, MapReduce and Query are ushering in data-driven products and service "Big data" is data that becomes large enough that it cannot be processed using conventional methods. Creators of web search engines were among the first to confront this problem. Today, social networks, mobile phones, sensors and science contribute to petabytes of data created daily. To meet the challenge of processing such large ...

Read more

2013 © Big Data Cloud Inc. All Rights Reserved.

Hadoop and the Hadoop elephant logo are trademarks of the Apache Software Foundation.

Scroll to top