Welcome!

Microsoft Cloud Authors: Pat Romanski, Elizabeth White, Liz McMillan, Mihai Corbuleac, David Bermingham

Related Topics: Containers Expo Blog, Java IoT, Microservices Expo, Agile Computing, @CloudExpo, Apache

Containers Expo Blog: Blog Feed Post

Object Storage Not Yet Defined

Agreed that object storage platforms scale better than file systems & NAS

The ExecEvent Object Storage Summit earlier this month continued to generate buzz on the industry, which is very exciting. Amplidata was represented – in spirit – at the Summit by our partners Intel and Quantum; due to an insane travel and show schedule this fall that kept us from attending personally.  We’re grateful for the mention in Storage Switzerland’s sponsor briefing articles. Very cool! With all the great stuff that has been happening for Amplidata lately, including the awesome performance test results by Howard Marks, we felt a bit like we were missing our own birthday party. We’ll be there next time!

The event fostered a few “What is Object Storage?” posts from, amongst others, George Crump. Jim O’Reilly also posted a very interesting article, although I’m not sure if he was at the event. If he wasn’t, he should be next time!

Both articles add to the body of knowledge that is rapidly evolving on what object storage is, and why customers should adopt it – so, every article helps. With a topic as technical as object storage, it’s easy to evangelize with a deep technical dive.  But that misses the “elegant simplicity” point.  Hence we love George’s use of the car park analogy which we ourselves often embrace.  His article was a helpful at-a-glance overview.  On a more technical level, Jim’s explanation of such concepts as immutable blobs, “the original version is the only version”, objects still look like files etc. offer more on how object storage really works. George’s analysis on how “Objects are given unique ID numbers” is what’s missing in Jim’s article. I guess, what we’re saying is “read both articles.”

But read them critically, and you will see that we’re not there yet. As you can read in Jim’s article, the paradigm has been around much longer than many of us know and we’re not complete in defining the best use cases, implementations, architectures, etc. For example, I’m not at all sure about the reduced metadata George writes about. I believe that over time, as we start using richer applications, we will be storing more metadata, not less. To me, Jim’s statement “To be an object, a blob of data needs a much more detailed descriptor record than what file systems use.” is more accurate.

Both articles also cover the “why” of Object Storage. I’m not sure I see the use of Jim’s deduplication paragraph, and I think we are missing erasure coding as an alternative to RAID in his article (replication can be expensive too!). Jim accurately mentions that block storage was I/O focused, but omits the exceptional throughput performance some of the object stores deliver. A good thing is that Jim sees the scalability, flexibility and cost-saving opportunities. Finally, I very much like his use cases: Google Picasa, Amazon S3, Genome etc. and it is very interesting to read that Jim sees potential for object storage in the Big Data analytics space.

So back to George’s take on why we need object storage. Agreed that object storage platforms scale better than file systems & NAS but, again, not so much because of the metadata. File systems have different challenges, such as the granularity of the hardware, limitations on numbers of files or the number of levels in the hierarchy. Distributed file systems tried to solve some of these issues, but object storage is just a much simpler approach. Agreed that adding NAS heads is an expensive and not so great solution!

The second topic I thought was interesting was the issue of “bit rot”. Bit rot is a real problem and will lead to data loss with traditional storage technologies, but not every object store will solve that. How I understood it is that it is the underlying data protection scheme that solves the problem of bit rot, not necessarily Object Storage. Erasure Coding detects bit rot and prevents data loss.  I don’t think you could restore the content of an object using the identifier, but maybe there is some really cool technology out there that I don’t know of. As George wrote “The storage system does not need an elaborate RAID protection algorithm nor do its administrators need to suffer through long RAID rebuild cycles”, I think he actually alludes to Erasure Coding but didn’t want to go that deep in this article.

Another interesting point in George’s article is the issue with backups. Once you go into the petabyte range, it becomes very unwieldy to backup data. He mentions the backup window, but add to that the overhead cost. George promotes using the unique IDs to make sure “that there are always copies of each object available on-site and off-site.” Again with the proper underlying protection schemes (erasure coding) you can rule out backups altogether!

I’m sure both George and Jim will appreciate the feedback – I fully agree with the benefits object storage brings to track iterations of files and the paragraph on geo dispersion, which we have termed geo-spreading. Finally, I hope to read some more of George’s thoughts about how object storage can help to monetize archived data as that, to me, is a key argument for this new but then again not so new storage paradigm. This is obviously not the end of the discussion; a lot will and needs to be said about this new paradigm. I’m looking forward to attending the next Object Storage events…

Read the original blog entry...

More Stories By Tom Leyden

Tom Leyden is VP Product Marketing at Scality. Scality was founded in 2009 by a team of entrepreneurs and technologists. The idea wasn’t storage, per se. When the Scality team talked to the initial base of potential customers, the customers wanted a system that could “route” data to and from individual users in the most scalable, efficient way possible. And so began a non-traditional approach to building a storage system that no one had imagined before. No one thought an object store could have enough performance for all the files and attachments of millions of users. No one thought a system could remain up and running through software upgrades, hardware failures, capacity expansions, and even multiple hardware generations coexisting. And no one believed you could do all this and scale to petabytes of content and billions of objects in pure software.

@ThingsExpo Stories
Cloud computing is being adopted in one form or another by 94% of enterprises today. Tens of billions of new devices are being connected to The Internet of Things. And Big Data is driving this bus. An exponential increase is expected in the amount of information being processed, managed, analyzed, and acted upon by enterprise IT. This amazing is not part of some distant future - it is happening today. One report shows a 650% increase in enterprise data by 2020. Other estimates are even higher....
SYS-CON Events announced today that Bsquare has been named “Silver Sponsor” of SYS-CON's @ThingsExpo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. For more than two decades, Bsquare has helped its customers extract business value from a broad array of physical assets by making them intelligent, connecting them, and using the data they generate to optimize business processes.
A strange thing is happening along the way to the Internet of Things, namely far too many devices to work with and manage. It has become clear that we'll need much higher efficiency user experiences that can allow us to more easily and scalably work with the thousands of devices that will soon be in each of our lives. Enter the conversational interface revolution, combining bots we can literally talk with, gesture to, and even direct with our thoughts, with embedded artificial intelligence, wh...
Internet of @ThingsExpo, taking place November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 19th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago. All major researchers estimate there will be tens of billions devices - comp...
19th Cloud Expo, taking place November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Meanwhile, 94% of enterpri...
Connected devices and the industrial internet are growing exponentially every year with Cisco expecting 50 billion devices to be in operation by 2020. In this period of growth, location-based insights are becoming invaluable to many businesses as they adopt new connected technologies. Knowing when and where these devices connect from is critical for a number of scenarios in supply chain management, disaster management, emergency response, M2M, location marketing and more. In his session at @Th...
It is one thing to build single industrial IoT applications, but what will it take to build the Smart Cities and truly society changing applications of the future? The technology won’t be the problem, it will be the number of parties that need to work together and be aligned in their motivation to succeed. In his Day 2 Keynote at @ThingsExpo, Henrik Kenani Dahlgren, Portfolio Marketing Manager at Ericsson, discussed how to plan to cooperate, partner, and form lasting all-star teams to change t...
The cloud market growth today is largely in public clouds. While there is a lot of spend in IT departments in virtualization, these aren’t yet translating into a true “cloud” experience within the enterprise. What is stopping the growth of the “private cloud” market? In his general session at 18th Cloud Expo, Nara Rajagopalan, CEO of Accelerite, explored the challenges in deploying, managing, and getting adoption for a private cloud within an enterprise. What are the key differences between wh...
The 19th International Cloud Expo has announced that its Call for Papers is open. Cloud Expo, to be held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, brings together Cloud Computing, Big Data, Internet of Things, DevOps, Digital Transformation, Microservices and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportuni...
Internet of @ThingsExpo, taking place November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with the 19th International Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world and ThingsExpo Silicon Valley Call for Papers is now open.
There is little doubt that Big Data solutions will have an increasing role in the Enterprise IT mainstream over time. Big Data at Cloud Expo - to be held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA - has announced its Call for Papers is open. Cloud computing is being adopted in one form or another by 94% of enterprises today. Tens of billions of new devices are being connected to The Internet of Things. And Big Data is driving this bus. An exponential increase is...
In his general session at 18th Cloud Expo, Lee Atchison, Principal Cloud Architect and Advocate at New Relic, discussed cloud as a ‘better data center’ and how it adds new capacity (faster) and improves application availability (redundancy). The cloud is a ‘Dynamic Tool for Dynamic Apps’ and resource allocation is an integral part of your application architecture, so use only the resources you need and allocate /de-allocate resources on the fly.
In his keynote at 18th Cloud Expo, Andrew Keys, Co-Founder of ConsenSys Enterprise, provided an overview of the evolution of the Internet and the Database and the future of their combination – the Blockchain. Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life sett...
Machine Learning helps make complex systems more efficient. By applying advanced Machine Learning techniques such as Cognitive Fingerprinting, wind project operators can utilize these tools to learn from collected data, detect regular patterns, and optimize their own operations. In his session at 18th Cloud Expo, Stuart Gillen, Director of Business Development at SparkCognition, discussed how research has demonstrated the value of Machine Learning in delivering next generation analytics to imp...
There are several IoTs: the Industrial Internet, Consumer Wearables, Wearables and Healthcare, Supply Chains, and the movement toward Smart Grids, Cities, Regions, and Nations. There are competing communications standards every step of the way, a bewildering array of sensors and devices, and an entire world of competing data analytics platforms. To some this appears to be chaos. In this power panel at @ThingsExpo, moderated by Conference Chair Roger Strukhoff, Bradley Holt, Developer Advocate a...
Cognitive Computing is becoming the foundation for a new generation of solutions that have the potential to transform business. Unlike traditional approaches to building solutions, a cognitive computing approach allows the data to help determine the way applications are designed. This contrasts with conventional software development that begins with defining logic based on the current way a business operates. In her session at 18th Cloud Expo, Judith S. Hurwitz, President and CEO of Hurwitz & ...
SYS-CON Events announced today that ReadyTalk, a leading provider of online conferencing and webinar services, has been named Vendor Presentation Sponsor at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. ReadyTalk delivers audio and web conferencing services that inspire collaboration and enable the Future of Work for today’s increasingly digital and mobile workforce. By combining intuitive, innovative tec...
Amazon has gradually rolled out parts of its IoT offerings, but these are just the tip of the iceberg. In addition to optimizing their backend AWS offerings, Amazon is laying the ground work to be a major force in IoT - especially in the connected home and office. In his session at @ThingsExpo, Chris Kocher, founder and managing director of Grey Heron, explained how Amazon is extending its reach to become a major force in IoT by building on its dominant cloud IoT platform, its Dash Button strat...
industrial company for a multi-year contract initially valued at over $4.0 million. In addition to DataV software, Bsquare will also provide comprehensive systems integration, support and maintenance services. DataV leverages advanced data analytics, predictive reasoning, data-driven diagnostics, and automated orchestration of remediation actions in order to improve asset uptime while reducing service and warranty costs.
Vidyo, Inc., has joined the Alliance for Open Media. The Alliance for Open Media is a non-profit organization working to define and develop media technologies that address the need for an open standard for video compression and delivery over the web. As a member of the Alliance, Vidyo will collaborate with industry leaders in pursuit of an open and royalty-free AOMedia Video codec, AV1. Vidyo’s contributions to the organization will bring to bear its long history of expertise in codec technolo...