Shifting sands: Multidimensional Scaling and Company Similarity

Monday, July 30, 2012

Multidimensional Scaling and Company Similarity

Background and idea

Often we are looking at a particular sector, and want to get a quick overview of a group of companies relative to one another. I thought I might apply Multidimensional Scaling (MDS) to various financial ratios and see if it gave us anything useful.

The premise is that companies in similar industries should all have a degree of sameness, so MDS might be useful to highlight the companies that stand out from the crowd, perhaps in some literal sense ...

Method

I mostly use the data functions from quantmod to retrieve the financial statements from Google Finance. As always with free data, the quality is variable, but good enough for our purpose today. We need to do a bit of dancing to get the market price at the time the results were released, and this uses data from Yahoo Finance. It was a little bit more work to implement, but worth it so we can include P/E in the comparison.

I looked at two groups of companies, tech stocks and financials/banks.

For the tech stocks I used ROE, EPS, P/E, Operating Margin, Current Ratio, Gearing, Asset Turnover and Debt Ratio. For the financials, I used ROE, EPS, P/E, Gearing and Debt Ratio, mainly because the data available did not have the line items required to calculate the other ratios.

The data from Google gives the last four periods, with the most recent coming first. It also gives Annual and Quarterly data and the charts below use the annual results. Annual Period 1 means the most recent results. Due to the scaling function, the actual scales on the graphs are not particularly meaningful, so I took them out.

Charts

These are the charts for the most recent results (so end of year 2011). Overall, I am quite pleased with the results. We can see how most of the companies cluster together, while a few seem to be quite different. This shows at a glance the companies that might be worthy of further investigation.

Tech Stocks

Financials

Outro

Code is up here MDS Company Similarity with R, it should hopefully be documented enough for others to mess around with. Any questions, comments or suggestions are very much appreciated as always.

As an aside, this is the first R program I wrote devoid of any for loops. I finally feel I am coming to grips with the language.

4 comments:

margareemanJuly 31, 2012 at 8:33 PM
In case it is of value: I receive this error (and I apologize, but I have not done any debugging ...)

sapply(symbols, get_prices, env=finEnv)
Error in as.Date.default(end_dates) :
do not know how to convert 'end_dates' to class "Date"

My Windows 7 install running in RStudio 0.96.304

R version 2.15.1 (2012-06-22) -- "Roasted Marshmallows"
Copyright (C) 2012 The R Foundation for Statistical Computing
ISBN 3-900051-07-0
Platform: x86_64-pc-mingw32/x64 (64-bit)
ReplyDelete
Replies
PeteAugust 3, 2012 at 3:41 AM
So I took a look and could reproduce it, turns out yahoo does not have RBS data going back to 2008. I should probably handle that better in the code, I'll take a look at it over the weekend. Thanks for letting me know!
ReplyDelete
Replies
James LiAugust 6, 2012 at 7:05 PM
Very interesting experiment! Would be more interesting if the dataset contains more companies, say 1000.
ReplyDelete
Replies

Add comment

Pages