<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Musicmetric &#187; time series</title>
	<atom:link href="http://www.musicmetric.com/tag/time-series/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.musicmetric.com</link>
	<description>Sexy Data</description>
	<lastBuildDate>Thu, 02 Feb 2012 14:21:32 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Analysing trends over time with musicmetric</title>
		<link>http://www.musicmetric.com/2009/12/analysing-trends-over-time-with-musicmetric/</link>
		<comments>http://www.musicmetric.com/2009/12/analysing-trends-over-time-with-musicmetric/#comments</comments>
		<pubDate>Sun, 13 Dec 2009 15:13:49 +0000</pubDate>
		<dc:creator>Greg</dc:creator>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Labs]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[music analytics]]></category>
		<category><![CDATA[time series]]></category>
		<category><![CDATA[trend analysis]]></category>

		<guid isPermaLink="false">http://wptest.musicmetric.com/?p=269281521</guid>
		<description><![CDATA[In this blog post we’re going to look at an example of some of the data mining and large scale analysis which we do at musicmetric, detecting patterns and similarities in time series data.]]></description>
			<content:encoded><![CDATA[<p>In this blog post we’re going to look at an example of some of the data mining and large scale analysis which we do at musicmetric, detecting patterns and similarities in time series data.</p>
<p>One use of this analysis is that given an artist, we can find another artist with the closest trend in some variable over time – for example MySpace plays per hour. Alternatively we could generate a list of artists who are increasing in popularity in a certain way, or show which artists have had a brief surge in activity – maybe caused an album release or gig.</p>
<p>Because we store all the data indefinitely and in such a way that we can access it very rapidly, we can run regular batch analysis on the contents of our data warehouse to unlock interesting information.</p>
<p>In this example, we will compare the play count time series data for the top 20,000 artists by total plays on MySpace. It is important to consider that some trends may follow each other with a time lag, so we compare the 20K time series at multiple time lags from 0 to 30 days in the past, in 1 day increments. This means the approximate number of time series comparisons our analysis servers must do for this particular problem is 6 Billion, each one comparing hourly resolution data over a period of 4 months.</p>
<p>Let’s take a look at which artist has a similar trend to Kings of Leon:</p>
<div class="wp-caption alignnone" style="width: 571px"><a href="/wp-content/uploads/2009/12/kol_thefray3.jpg"><img title="Kings of Leon and The Fray - Plays per Hour" src="/wp-content/uploads/2009/12/kol_thefray3.jpg" alt="Kings of Leon and The Fray - MySpace Plays Per Hour" width="561" height="421" /></a><p class="wp-caption-text">Kings of Leon and The Fray - MySpace Plays Per Hour</p></div>
<p>We can see the plays per hour for The Fray seem to be following a similar long term trend to that of Kings of Leon, but offset by the difference in their popularity on MySpace – although they are converging as time goes on. The peaks and troughs also line up, so clearly the fine resolution hourly variation in the data has something to do with the overall use of MySpace at any period in time, not just the popularity of the artist. This is something that can be seen over most MySpace data.</p>
<p>Now let’s look at two artists who have even more similar plays per hour to each other:</p>
<div class="wp-caption alignnone" style="width: 570px"><a href="/wp-content/uploads/2009/12/dido_theclash.jpg"><img title="Dido and The Clash - MySpace Plays Per Hour" src="/wp-content/uploads/2009/12/dido_theclash.jpg" alt="Dido and The Clash - MySpace Plays Per Hour" width="560" height="420" /></a><p class="wp-caption-text">Dido and The Clash - MySpace Plays Per Hour</p></div>
<p>The Clash and Dido show very high similarity for plays per hour on MySpace over the time frame shown in the chart above. A lot of this will have to do with the overall use of MySpace at any period of time, and the fact that the two artists have not had a lot of activity during that period to make their play counts diverge from each other.</p>
<p>Finally, we&#8217;ll search for artists that show similar short term peaks to one other. In this case Muse was flagged as a high match for 50 Cent in September 2009, as is clear in the chart below:</p>
<div class="wp-caption alignnone" style="width: 570px"><a href="/wp-content/uploads/2009/12/50cent_muse.jpg"><img title="Muse and 50 Cent - MySpace Plays Per Hour" src="/wp-content/uploads/2009/12/50cent_muse.jpg" alt="Muse and 50 Cent - MySpace Plays Per Hour" width="560" height="420" /></a><p class="wp-caption-text">Muse and 50 Cent - MySpace Plays Per Hour</p></div>
<p>If we look at their discographies – we discover that both Muse and 50 Cent made a release on the same day in September.</p>
<p>We’ll investigate the different reasons why two artists might have similar trends to each other in another blog post, so check back soon!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.musicmetric.com/2009/12/analysing-trends-over-time-with-musicmetric/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

