<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Backup &#38; Beyond (mostly beyond) &#187; StoreGrid</title>
	<atom:link href="http://blogs.vembu.com/tag/storegrid/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.vembu.com</link>
	<description></description>
	<lastBuildDate>Wed, 21 Jul 2010 04:01:11 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Amazon gets calculative</title>
		<link>http://blogs.vembu.com/2009/11/amazon-gets-calculative/</link>
		<comments>http://blogs.vembu.com/2009/11/amazon-gets-calculative/#comments</comments>
		<pubDate>Wed, 25 Nov 2009 12:24:09 +0000</pubDate>
		<dc:creator>lux</dc:creator>
				<category><![CDATA[Amazon Cloud]]></category>
		<category><![CDATA[Online Backup]]></category>
		<category><![CDATA[SP Edition]]></category>
		<category><![CDATA[Vembu StoreGrid]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[AWS]]></category>
		<category><![CDATA[StoreGrid]]></category>
		<category><![CDATA[Vembu]]></category>

		<guid isPermaLink="false">http://blogs.vembu.com/?p=672</guid>
		<description><![CDATA[Amazon now have an AWS calculator available. The calculator, currently in Beta, allows you to get an idea of the kind of costs you can expect.]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblogs.vembu.com%2F2009%2F11%2Famazon-gets-calculative%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblogs.vembu.com%2F2009%2F11%2Famazon-gets-calculative%2F&amp;style=normal&amp;hashtags=amazon,AWS,Online+Backup,StoreGrid,Vembu" height="61" width="50" /><br />
			</a>
		</div>
<p>.</p>
<p>Two days ago,  Amazon announced Amazon Web Services (AWS) calculator on their <a href="http://aws.typepad.com/aws/2009/11/the-new-aws-simple-monthly-calculator.html">blog</a>. I can see this becoming quite useful, and am, in fact, surprised that they didn&#8217;t do this earlier.</p>
<p>You can access the AWS Simple Monthly Calculator <a href="http://aws.typepad.com/aws/2009/11/the-new-aws-simple-monthly-calculator.html">here</a>. The calculator, currently in Beta, allows you to get an idea of the kind of costs you can expect for different services that you might use. They&#8217;ve even had a stab at typical figures you might be seeing for some &#8216;typical use cases&#8217;.</p>
<p><img class="aligncenter size-medium wp-image-676" title="awscalc" src="http://blogs.vembu.com/wp-content/uploads/2009/11/awscalc-300x225.jpg" alt="awscalc" width="300" height="225" /></p>
<p><em>Don&#8217;t see the use cases? You need to scroll to the right if your resolution is 1024 x 768 (or lower)  &#8211; see the screenshot above.</em></p>
<p>The five use cases they have are (comments in brackets are my own):</p>
<ul>
<li>A marketing website (bandwidth intensive; requests from visitors)</li>
<li>A Web App (would use more of computing resources, including DB)</li>
<li>A Media application (storage &amp; bandwidth intensive)</li>
<li>HPC Cluster (CPU intensive)</li>
<li>Disaster Recovery &amp; Backup (storage intensive and computing to a lesser degree)</li>
</ul>
<p>Take the numbers with a pinch of salt (an ounce, in the case of DR &amp; Backup). But that&#8217;s not the point &#8211; I think it is pretty bold of them to hazard a guess knowing fully well that one size won&#8217;t fit all&#8230;</p>
<p>Rather than take the numbers at face value, you should focus on the AWS services in the &#8216;mix&#8217;, i.e. the different services that AWS assume you will use for each scenario. It serves to give you a good idea of the services you should be looking at.</p>
<p>We&#8217;ll take a more detailed look and provide some data that&#8217;s more relevant to StoreGrid + AWS deployments. Stay tuned.</p>
<p><em>The above post was written by Lakshmanan (Lux) Narayan of Vembu Technologies. Vembu Technologies is a backup software vendor whose product, StoreGrid, powers the <a href="http://www.vembu.com/">online backup</a> services of a large number of service providers across the globe. Besides <a href="http://www.vembu.com/storegrid/online-backup.php">remote backup</a>, StoreGrid is also used for on premise backups of workstations and servers at various companies &amp; universities.</em></p>
<p><em><a href="http://home.vembu.com/">Vembu Home</a> is the only <strong>FREE </strong>consumer backup solution for free local backups and optional Amazon Cloud backups. Get your <a href="http://home.vembu.com/">FREE COPY</a> now.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.vembu.com/2009/11/amazon-gets-calculative/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why Google will kill the “I’m feeling Lucky” button</title>
		<link>http://blogs.vembu.com/2009/07/why-google-will-kill-the-im-feeling-lucky-button/</link>
		<comments>http://blogs.vembu.com/2009/07/why-google-will-kill-the-im-feeling-lucky-button/#comments</comments>
		<pubDate>Wed, 29 Jul 2009 10:15:55 +0000</pubDate>
		<dc:creator>lux</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Online Backup]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[im feeling lucky]]></category>
		<category><![CDATA[StoreGrid]]></category>
		<category><![CDATA[Vembu]]></category>
		<category><![CDATA[wall street]]></category>

		<guid isPermaLink="false">http://blogs.vembu.com/?p=392</guid>
		<description><![CDATA[If Google removes the "I'm feeling lucky" button, it stands to gain $330mn in revenues &#038; $230mn in net income. This is too attractive an opportunity to ignore and under pressure from Wall Street, you can expect Google to remove the iconic button soon!]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblogs.vembu.com%2F2009%2F07%2Fwhy-google-will-kill-the-im-feeling-lucky-button%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblogs.vembu.com%2F2009%2F07%2Fwhy-google-will-kill-the-im-feeling-lucky-button%2F&amp;style=normal&amp;hashtags=google,im+feeling+lucky,Online+Backup,StoreGrid,Vembu,wall+street" height="61" width="50" /><br />
			</a>
		</div>
<p>Google is key to our business! We spend a lot of time studying how people looking for <a href="http://www.vembu.com" target="_blank">online backup solutions</a> found us – through the organic &amp; paid results on Google.</p>
<p>Besides this affinity for many things ‘Google’, three other events inspired this post:</p>
<p><a href="http://news.bbc.co.uk/2/hi/business/8154955.stm" target="_blank"><img class="alignright size-full wp-image-398" title="im feeling lucky google" src="http://blogs.vembu.com/wp-content/uploads/2009/07/googles.jpg" alt="im feeling lucky google" width="464" height="248" /></a></p>
<p>a) Google <a href="http://news.bbc.co.uk/2/hi/business/8154955.stm" target="_blank">announced </a>its quarterly results earlier this month. Though revenues were almost stagnant, Google managed to deliver a higher net income of $ 1.48 bn (Qtr ending June 30, 2009)</p>
<p>b) <a href="http://www.techcrunch.com/2009/07/24/are-bing-users-are-twice-as-likely-to-click-on-an-ad-than-google-users/" target="_blank">Techcrunch says</a> Bing users are twice as likely to click on ads than Google users.</p>
<p>c) I clicked on the ‘I’m feeling lucky’ button on Google today – more out of curiosity than out of habit! This got me thinking&#8230;</p>
<p><span style="color: #800000;"><strong>Who clicks the ‘I’m feeling Lucky’ button anyway?</strong></span></p>
<p>So, I googled it up (note the subtle irony here) and came across a very <a href="http://gawker.com/tech/google/im-feeling-lucky-button-costs-google-110-million-per-year-324927.php" target="_blank">interesting article</a> from Nov 2007 ; here’s an excerpt:</p>
<p><span style="color: #333333;"><em>Google cofounder Sergey Brin told public radio&#8217;s Marketplace that around one percent of all Google searches go through the &#8220;I&#8217;m Feeling Lucky&#8221; button. Because the button takes users directly to the top search result, Google doesn&#8217;t get to show search ads on one percent of all its searches. That costs the company around $110 million in annual revenue, according to Rapt&#8217;s Tom Chavez. So why does Google keep such a costly button around? &#8220;It&#8217;s possible to become too dry, too corporate, too much about making money. I think what&#8217;s delightful about &#8216;I&#8217;m Feeling Lucky&#8217; is that it reminds you there are real people here,&#8221; Google exec Marissa Mayer explained, or at least tried to.</em></span></p>
<p>So Google left $110 million on the table two years ago! Let’s try and recalculate the figure now&#8230;</p>
<p>a) Google’s annual revenue in 2008 was $22 billion – about twice its revenues in 2006. 66% of Google’s revenues are from Google owned sites. I’m simply assuming that Google.com (and other country sites) account for about 50%, and properties like Gmail and other Google sites account for the balance 16%. In effect, I’m assuming that search from Google homepage(s) contributes to $11bn of the total $22bn revenue.</p>
<p>b) Lets assume that the ‘1% of people click on I’m Feeling Lucky’ still holds. In fact, I suspect that this number would be higher considering the increasing number of newbie net users especially in developing countries! You’d expect the people who click on “I’m Feling Lucky” to be net newbies and unable to really differentiate between organic and paid results. Those well crafted &amp; well placed ads (like ours!) would attract their attention a lot more.</p>
<p>c) If the average Bing user is twice as likely as the average Google user to click on an ad, I’d argue that the average ‘I’m feeling lucky’ clicker, had he clicked on the money making ‘Google Search’ button instead, would be at least thrice as likely to click on an ad. Hence the ‘I’m feeling Lucky’ guys could account for (almost) 3% of ‘opportunity’ revenues</p>
<p>So, let’s do the math&#8230;</p>
<p>$11bn in annual revenues x a conservative 3% =</p>
<p><span style="color: #800000;"><strong>$ 330mn in increased revenues &#8211; simply by removing the ‘I’m feeling lucky’ button!</strong></span></p>
<p>What does this do for the bottom-line?</p>
<p>There is no direct cost (of revenue/goods sold) for this ad inventory – unlike the ad inventory from the content network where about 75% of the revenues are paid out to the content network partner. Hence this is $330mn of PURE Gross Profit! Google typically spends 30% of its revenues in R&amp;D &amp; G&amp;A expenses. Assuming this ratio applies for the incremental revenues, and taking that expense figure of 30% out ($100mn) , you’d expect them to add a whopping  $230mn annually to their net income &#8211; simply by removing the ‘I’m feeling lucky’ button!</p>
<p><strong><span style="color: #800000;">This $230mn translates to an increase (in net income) of over 5% </span></strong></p>
<p>I believe this is too tempting an opportunity for Google to pass up! When the going is good, its easy to say stuff like <span style="color: #333333;"><em>&#8220;It&#8217;s possible to become too dry, too corporate, too much about making money&#8230; &#8220;</em></span> . In the current economic climate, you can expect Google to milk every opportunity they have. After all, they still ‘answer’ to Wall Street!</p>
<p>Considering all this, the “I’m Feeling Lucky” opportunity is too much of a potential cash cow (or sitting duck, or low hanging fruit &#8211; pick your metaphor) to pass by. Remember, it translates into a direct increase of 5% in net income!</p>
<p>Of course, when it happens, Google won’t tell you the real reason! Expect to hear something like&#8230;</p>
<p><span style="color: #333333;"><em>“&#8230;extensive research conducted by Google showed that the absence of the I’m Feeling Lucky button translated to a user taking 6 milliseconds less &#8211; to reach the results page he was looking for. As much as the ‘I’m Feeling Lucky’ is a part of Google history, removing it makes for an enhanced user experience, and finally, Google is all about the user’s experience. So, with a heavy heart&#8230;” </em></span></p>
<p>As <a href="http://www.indiainfoline.com/" target="_blank">some people</a> say, <span style="color: #800000;"><strong>It’s all about the Money, Honey!</strong></span></p>
<p>Remember, you saw it here first!</p>
<p><em>The above post was written by Lakshmanan (Lux) Narayan of Vembu Technologies. Vembu Technologies is a backup software vendor whose product, StoreGrid, powers the <a href="http://www.vembu.com">online backup</a> services of a large number of service providers across the globe. Besides <a href="http://www.vembu.com/storegrid/online-backup.php">remote backup</a>, StoreGrid is also used for on premise backups of workstations and servers at various companies &amp; universities.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.vembu.com/2009/07/why-google-will-kill-the-im-feeling-lucky-button/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>Backup &#8211; Coming soon to a theatre near you</title>
		<link>http://blogs.vembu.com/2009/07/the-backup-plan/</link>
		<comments>http://blogs.vembu.com/2009/07/the-backup-plan/#comments</comments>
		<pubDate>Mon, 27 Jul 2009 05:37:49 +0000</pubDate>
		<dc:creator>lux</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Online Backup]]></category>
		<category><![CDATA[Remote Backup]]></category>
		<category><![CDATA[Vembu StoreGrid]]></category>
		<category><![CDATA[backup]]></category>
		<category><![CDATA[Jennifer Lopez]]></category>
		<category><![CDATA[JLo]]></category>
		<category><![CDATA[StoreGrid]]></category>
		<category><![CDATA[The Backup Plan]]></category>
		<category><![CDATA[Vembu]]></category>

		<guid isPermaLink="false">http://blogs.vembu.com/?p=380</guid>
		<description><![CDATA["The Backup Plan" is the name of a company run by Tim McGuire. The company offers Vembu StoreGrid powered Online Backup Services at http://thebackupplan.com . "The Backup Plan" is also the name of a movie that stars JLo &#038; is currently under production. As a result, Tim's getting more than he bargained for - tons of fans searching for the latest on their idol's latest movie!]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblogs.vembu.com%2F2009%2F07%2Fthe-backup-plan%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblogs.vembu.com%2F2009%2F07%2Fthe-backup-plan%2F&amp;style=normal&amp;hashtags=backup,Jennifer+Lopez,JLo,Online+Backup,StoreGrid,The+Backup+Plan,Vembu" height="61" width="50" /><br />
			</a>
		</div>
<div id="attachment_384" class="wp-caption alignleft" style="width: 262px"><img class="size-medium wp-image-384" title="jenniferlopez" src="http://blogs.vembu.com/wp-content/uploads/2009/07/jenniferlopez-252x300.jpg" alt="JLo picture from: Wikimedia Commons" width="252" height="300" /><p class="wp-caption-text">Credit : Wikimedia Commons</p></div>
<p>Ah, the price of fame!</p>
<p>Tim McGuire of  <a title="The Backup Plan" href="http://thebackupplan.com" target="_blank">&#8220;The Backup Plan&#8221;</a> has been offering his customers &#8216;Vembu powered&#8217; online backup services since January 2007. His company&#8217;s website is <a title="The Backup Plan" href="http://thebackupplan.com" target="_blank">http://thebackupplan.com</a></p>
<p>All&#8217;s been well, and Tim has been steadily signing up new customers and backing them up!</p>
<p>However, in recent times, in spite of more business, and a huge traffic surge, a smaller proportion of his website visitors are really keen on his backup services!What happened? Well, Hollywood happened&#8230;</p>
<p><a href="http://www.imdb.com/title/tt0799934/" target="_blank"></a></p>
<p>I have Google Alerts set up for various terms, and &#8216;backup&#8217; is obviously one of them. As a consequence, I get snippets on everything from generator backups to data backups. Last month, I noticed a couple of alerts for &#8216;The Backup Plan&#8217;. I clicked on the link expecting to see something about Tim&#8217;s firm, but was surprised to see JLo&#8217;s smiling face instead! Apparently, &#8220;The Backup Plan&#8221; is the title of a<a title="The Backup-Up Plan" href="http://www.imdb.com/title/tt1212436/" target="_blank"> </a><a href="http://www.imdb.com/title/tt1212436/" target="_blank">new movie starring JLo</a>. The movie is scheduled for a 2010 release but the spin doctors are already hard at work, and legions of crazed fans are typing thebackupplan into their browsers &amp; search engines &#8211; hoping to see more on Jlo!</p>
<p>Not one to disappoint, Tim has even created<a title="The Backup Plan Movie" href="http://thebackupplanmovie.com/" target="_blank"> </a><a href="http://thebackupplanmovie.com/" target="_blank">a link for fans</a> &#8211; that takes them to another site that focuses on the movie!</p>
<p>I&#8217;d emailed Tim last month asking him if he was being wooed by Hollywood, and this is what he had to say&#8230;</p>
<p><em>&#8220;&#8230;I am actually getting a brisk amount of traffic coming from people searching for J-Lo.  Unfortunately, not a lot of them are interested in backing up.  I did get a call from the movie people themselves, but they are mostly interested in seeing if I can link to their site, but they actually do not have one set up yet.  thebackupplanmovie.com is actually my site also; I started that when the traffic was spiking to my main site.  I wanted to try to divert the movie traffic to a site just for the movie, and leave my customers, prospects, and partners as the ones hitting my server.  So, on that site, I just started blogging about the movie and actors, and giving the fans of the actors a place to chat about the movie&#8230;.&#8221;</em></p>
<p>In fact, when I last checked, <a title="Ode to Backup Plan" href="http://thebackupplanmovie.com/2009/07/22/ode-to-a-back-up-plan/" target="_blank">Tim has even written an ode to The Backup Plan &amp; JLo</a>! He also <a title="Backup Plan Blog" href="http://thebackupplanmovie.com/2009/07/17/the-back-up-plan-official-site-is-alive/" target="_blank">blogged about the launch of the movie&#8217;s website</a> and called it &#8216;cute&#8217;. He almost seems to be enjoying his new role; it certainly looks like Tim&#8217;s crossing over to the dark side! You could help avoid this by <a title="The Backup Plan" href="http://thebackupplan.com" target="_blank">visiting his site</a>; it would be great for him to get more people looking for the <span style="text-decoration: underline;">real backup plan</span>!</p>
<p>It will be interesting to see how his traffic jumps when The Backup Plan (movie) releases in 2010! I&#8217;ll save that for a new post&#8230;next year!</p>
<p><em>The above post was written by Lakshmanan (Lux) Narayan of Vembu Technologies. Vembu Technologies is a backup software vendor whose product, StoreGrid, powers the <a href="http://www.vembu.com">online backup</a> services of a large number of service providers across the globe. Besides <a href="http://www.vembu.com/storegrid/online-backup.php">remote backup</a>, StoreGrid is also used for on premise backups of workstations and servers at various companies &amp; universities.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.vembu.com/2009/07/the-backup-plan/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Should you seed your backup? Ask Wolfram Alpha!</title>
		<link>http://blogs.vembu.com/2009/05/should-you-seed-your-backup-ask-wolfram-alpha/</link>
		<comments>http://blogs.vembu.com/2009/05/should-you-seed-your-backup-ask-wolfram-alpha/#comments</comments>
		<pubDate>Tue, 19 May 2009 07:50:40 +0000</pubDate>
		<dc:creator>lux</dc:creator>
				<category><![CDATA[Broadband]]></category>
		<category><![CDATA[General]]></category>
		<category><![CDATA[Online Backup]]></category>
		<category><![CDATA[Remote Backup]]></category>
		<category><![CDATA[Trends]]></category>
		<category><![CDATA[LtoR]]></category>
		<category><![CDATA[managed services]]></category>
		<category><![CDATA[StoreGrid]]></category>
		<category><![CDATA[Vembu]]></category>
		<category><![CDATA[Wolfram Alpha]]></category>

		<guid isPermaLink="false">http://blogs.vembu.com/?p=318</guid>
		<description><![CDATA[Should you seed your backup? Ask Wolfram Alpha!
Unless you've been living under a rock for the last few months or thought it was new version of Wolf, you've probably heard of the Wolfram Alpha engine and the (wrong) comparisons to Google. Here's one interesting application of the "computational knowledge engine"]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblogs.vembu.com%2F2009%2F05%2Fshould-you-seed-your-backup-ask-wolfram-alpha%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblogs.vembu.com%2F2009%2F05%2Fshould-you-seed-your-backup-ask-wolfram-alpha%2F&amp;style=normal&amp;hashtags=LtoR,managed+services,Online+Backup,StoreGrid,Vembu,Wolfram+Alpha" height="61" width="50" /><br />
			</a>
		</div>
<p>Unless you&#8217;ve been living under a rock for the last few months or mistook it for a new version of <a href="http://en.wikipedia.org/wiki/Wolfenstein_3D">Wolf</a>, you&#8217;ve probably heard of the <a href="http://www62.wolframalpha.com">Wolfram Alpha engine</a> and the (wrong) comparisons to Google.</p>
<p><script id="WolframAlphaScript" src="http://www.wolframalpha.com/input/embed/?type=large" type="text/javascript"></script></p>
<p>For starters, Wolfram Alpha is <strong>NOT </strong>a search engine &#8211; its (what they call) a <strong>&#8220;computational knowledge engine&#8221;</strong> &#8211; it generates output by doing computations from its own internal knowledge base, instead of searching the web and returning links.</p>
<p><strong>Slight detour to StoreGrid&#8230;</strong></p>
<p>StoreGrid has a feature called <a href="http://www.vembu.com/storegrid/sp-help/PageHelp/server-side-local-to-remote-server-migration.html">&#8220;Local to Remote Migration&#8221; <strong>(LtoR)</strong></a> wherein you can &#8217;seed&#8217; a backup server with the 1st full backup and can therefore start with doing only incremental backups from day 1 itself. You&#8217;d typically take advantage of this feature for &#8216;first full backups&#8217; that you think would take a long time.</p>
<p><strong>How much time will the first backup take? Ask Wolfram Alpha&#8230;</strong></p>
<p>For example, if you had 10GB and wanted to know how long it would take over a 1Mbps line (For asymmetric broadband plans, remember that you should need to look for the upload speed &#8211; NOT download), simply enter &#8220;<strong>data transfer time 10GB, 1Mb/s</strong>&#8221; &#8211; try it out using the box above.</p>
<p>See the answer, and if you don&#8217;t have the time, go the <a href="http://www.vembu.com/storegrid/sp-help/PageHelp/server-side-local-to-remote-server-migration.html">LtoR way</a>! Go ahead, give it a spin.</p>
<p><a href="http://www62.wolframalpha.com/examples/WebAndComputerSystems.html">Click here</a> for more examples of &#8220;Web &amp; Computer&#8221; stuff you can use Wolfram Alpha for. A larger list of possibilities is <a href="http://www62.wolframalpha.com/examples/">here</a> &#8211; and includes things like knowing <a href="http://www62.wolframalpha.com/input/?i=calories+in+1+bowl+of+corn+flakes+%2B+a+glass+of+OJ">how many calories you had for breakfast</a> and getting <a href="http://www62.wolframalpha.com/input/?i=International+Space+Station">information about satellites</a>.</p>
<p>What&#8217;s more, it even knows <a href="http://www62.wolframalpha.com/input/?i=the+meaning+of+life">the meaning of life</a>!</p>
<p>Have fun!</p>
<p><em>The above post was written by Lakshmanan (Lux) Narayan of Vembu Technologies. Vembu Technologies is a backup software vendor whose product, StoreGrid, powers the <a href="http://www.vembu.com">online backup</a> services of a large number of service providers across the globe. Besides <a href="http://www.vembu.com/storegrid/online-backup.php">remote backup</a>, StoreGrid is also used for on premise backups of workstations and servers at various companies &#038; universities.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.vembu.com/2009/05/should-you-seed-your-backup-ask-wolfram-alpha/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Will the real data please stand up? A look at deduplication in the online backup world</title>
		<link>http://blogs.vembu.com/2009/02/will-the-real-data-please-stand-up-a-look-at-deduplication-in-the-online-backup-world/</link>
		<comments>http://blogs.vembu.com/2009/02/will-the-real-data-please-stand-up-a-look-at-deduplication-in-the-online-backup-world/#comments</comments>
		<pubDate>Wed, 11 Feb 2009 10:50:44 +0000</pubDate>
		<dc:creator>Sekar Vembu</dc:creator>
				<category><![CDATA[Online Backup]]></category>
		<category><![CDATA[Remote Backup]]></category>
		<category><![CDATA[SP Edition]]></category>
		<category><![CDATA[Vembu StoreGrid]]></category>
		<category><![CDATA[Checksums]]></category>
		<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[Encryption]]></category>
		<category><![CDATA[StoreGrid]]></category>

		<guid isPermaLink="false">http://blogs.vembu.com/?p=191</guid>
		<description><![CDATA[
			
				
			
		
Talk about data deduplication (in the  backup and archiving domain) seems to be gaining a fair amount of momentum in  the last few years! Most enterprise backup software vendors like Symantec  (Veritas), EMC (Avamar) etc. support deduplication in some form or the other &#8211;  some do deduplication in the source system [...]]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: left; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblogs.vembu.com%2F2009%2F02%2Fwill-the-real-data-please-stand-up-a-look-at-deduplication-in-the-online-backup-world%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblogs.vembu.com%2F2009%2F02%2Fwill-the-real-data-please-stand-up-a-look-at-deduplication-in-the-online-backup-world%2F&amp;style=normal&amp;hashtags=Checksums,Deduplication,Encryption,Online+Backup,Remote+Backup,StoreGrid" height="61" width="50" /><br />
			</a>
		</div>
<p>Talk about <a href="http://en.wikipedia.org/wiki/Data_deduplication" target="_blank">data deduplication</a> (in the  backup and archiving domain) seems to be gaining a fair amount of momentum in  the last few years! Most enterprise backup software vendors like <a href="http://www.symantec.com/business/netbackup" target="_blank">Symantec  (Veritas)</a>, <a href="http://www.emc.com/products/detail/software/avamar.htm" target="_blank">EMC (Avamar)</a> etc. support deduplication in some form or the other &#8211;  some do deduplication in the source system (that is being backed up) and others  do deduplication at the target (backup/storage server). There are also pure  &#8220;deduplication based storage hardware vendors&#8221; like <a href="http://www.datadomain.com/" target="_blank">Data Domain</a> who have gained  considerable traction in the enterprise.<a href="http://blogs.vembu.com/wp-content/uploads/2009/02/data-deduplication.jpg"><img class="alignright size-full wp-image-213" title="Data Deduplication" src="http://blogs.vembu.com/wp-content/uploads/2009/02/data-deduplication.jpg" alt="" width="480" height="317" /></a></p>
<p>I am actually quite surprised by  the hype around deduplication and the adoption it seems to have gained in the  enterprise. The reason I am surprised is similar to the one I articulated in my  previous blog post: &#8220;<a href="http://blogs.vembu.com/2009/01/synthetic-full-backup-in-the-online-backup-world-are-we-inviting-trouble/" target="_blank">Synthetic Full Backup in the online backup world &#8211; Are we  inviting trouble?</a>&#8220;. The crux of my argument is that backup and archiving is  about building redundancy to the data and not about eliminating redundancy in  the name of efficiency of storage or network bandwidth. So it is my contention  that wherever feasible we should have as much redundancy to the data (that needs  backing up) and only under unavoidable circumstances should we resort to using  synthetic full backup or deduplication. Actually, let me state this more  strongly: <strong>&#8220;avoid falling for the synthetic full backup or deduplication  hype if you can!&#8221;</strong></p>
<p>But who am I to say this. I am neither an  “industry expert” nor am I Steve Jobs to say &#8220;this is what is good for you; take  it or leave it&#8221;. Given that we are a niche company trying to grow (and growing)  in the face of industry giants, we are actually contemplating building  deduplication support in our data backup software, StoreGrid. While not many of  our customers/partners are asking for it, we do get the occasional prospect  saying that deduplication (rather, the lack of it) is a show stopper feature for  them!</p>
<p>As we started thinking about and designing the best way to support  deduplication in StoreGrid, we encountered many options to consider and many  complexities to be handled. But at the end, <strong>we were left with a  fundamental question &#8211; whether a full-fledged deduplication is indeed possible  in the online backup world!</strong> Before I explain some of the options and  the complexities, and why we think a full-fledged de-duplication may not be  feasible in a pure online backup scenario, let me first get into a broad  overview of the two deduplication approaches&#8230;</p>
<p><strong>Deduplication at  the source (client) vs. at the target (backup server) : </strong>There are  vendors who claim they do the deduplication at the source (i.e. the client  system that is being backed up) as opposed to others who claim that they do  deduplication at the target (i.e. at the backup server). If deduplication is  done at the source then it is easy to deduplicate data at a block level across  all files within the source system. If deduplication is done at the target then  it is equally easy to deduplicate data at a block level across all files across  all the client systems backing up to the backup server. Quite obviously doing  deduplication across all files across all clients will be much more effective  than doing deduplication only at a client system level. It is theoretically  possible to do deduplication at the source system and still be able to  deduplicate across all systems backing up to the backup server. In this case,  each client (source) has to continuously update itself with the meta-data of the  blocks that are being stored in the backup server. The meta-data in this case  would simply be the checksums of the blocks. These checksums are looked up to  identify similar blocks of data. I have not personally tested such a product  myself &#8211; i.e. the ones doing deduplication at the source system and still being  able to deduplicate across all systems backing up to the backup server. But this  may not be as efficient in terms of performance as compared to doing the  deduplication at the backup server end, especially if the backup/storage server  resides at a remote data center (and the meta-data needs to be downloaded each  time from the remote server).</p>
<p>Armed with this background, lets dive  deeper into the implications of these &#8216;approaches&#8217; in the online backup  context&#8230;</p>
<p><span style="font-size: small;"><strong>Option 1: Deduplication at  target<br />
</strong></span>One of the most important requirements in the online  backup domain is that the data that is backed up is encrypted before the data  leaves the source system and is sent over the internet to the remote data center  (where the data is stored). Deduplication works by finding similar blocks across  all the files and physically storing only one copy of the block in the storage  system. And encryption works by destroying all patterns in a given data and  making the data random. Because of the way encryption eliminates all patterns,  trying to do deduplication on a set of encrypted files will have no effect &#8211;  i.e. finding similar blocks of data across encrypted data will not be of much  use as encryption would have eliminated all patterns. That means doing  deduplication at the remote storage end, where all the data from different  clients systems are encrypted and stored, is technically not possible. The  option of not encrypting the data that is being backed up to the remote data  center is not really an option in the online backup world.  Another point to note is that deduplication at target doesn&#8217;t really help much in  the case of an online backup scenario &#8211; clients still send all data across  and hence don&#8217;t save anything on bandwidth! Of course, you save on &#8217;server  side storage&#8217; but optimizing this, I&#8217;d assume, comes a distant second to  optimizing bandwidth utilization &#8211; for online backups!</p>
<p><span style="font-size: small;"><strong>Option 2: Deduplication at source &#8211; with a common encryption  key<br />
</strong></span>As I said before it is theoretically possible to do  deduplication at source and still be able to deduplicate across all client  systems in an organization. In order to do that, either the data should not be  encrypted during backup or all the client systems will have to use a common  encryption key to encrypt the data. Not encrypting the data is not really an  option with online backups. Using a common encryption key would mean that for  each block of data that is backed up the checksum signature of the unencrypted  block is also sent to the backup server where it is stored. Every client that is  backed up should look up this database of checksums stored in the backup server  before sending a block of data to the backup server. Though this can be done  efficiently, I am not really fond of this option, because of the performance  penalty, considering that the backup server is at a remote location in the case  of online backups.</p>
<p><strong><span style="font-size: small;">Option 3: Deduplication at local  target backup server &#8211; with offsite replication<br />
</span></strong>The only  practical option I can think of is to have a deployment model where all clients  in an organization backup to a local backup server &#8211; without encryption. The  backed up data is deduplicated at the local backup server and then encrypted and  sent to a remote backup or replication server. This deployment model will ensure  that the deduplication is done on data from across all clients backing up to the  local backup server.  Depending upon a customer&#8217;s preference, the local backup  server can either keep a copy of the deduplicated backed up data (for quicker  restores) or the backed up data at the local backup server can be purged (not  recommended) once the data is moved to the remote backup/replication  server.</p>
<p>In summary, we prefer the last approach, viz. doing the  deduplication at the target backup server which is deployed locally at the site  where clients systems are. This would allow the client to backup to the local  backup server without encrypting the data &#8211; thus facilitating  deduplication at the target. And for offsite storage, the data  from the local backup server would be deduplicated, encrypted and sent to the  remote backup or replication server.  This would also ensure that the benefits of bandwidth savings associated with deduplication are also achieved.</p>
<p>I look forward to feedback &amp;  suggestions on other &#8216;better&#8217; ways of implementing deduplication in the online  backup domain!</p>
<p><em>The above post was written by Sekar Vembu of  Vembu Technologies. Vembu Technologies is a backup software vendor whose  product, StoreGrid, powers the <a href="http://www.vembu.com" target="_blank">online backup</a> services of a large  number of service providers across the globe. Besides <a href="http://www.vembu.com/storegrid/online-backup.html" target="_blank">remote backup</a>, StoreGrid is also used for on premise backups of workstations and servers at  various companies &amp; universities.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.vembu.com/2009/02/will-the-real-data-please-stand-up-a-look-at-deduplication-in-the-online-backup-world/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
	</channel>
</rss>
