Knowledge Matters

Understanding knowledge relationships

graham durant-law's blog

Gloor’s Contribution Index Applied

spring diagramThis is the third post in a series that considers how I’m doing a large and complex social network analysis on a knowledge exchange organistion that has a world-wide presence. In my previous post I talked about data cleansing, and Gloor’s Contribution Index as a way of attributing data. In this post I’ll show you how I applied Gloor’s Contribution Index, but first let’s look at the adjacent network as a spring diagram. I’m sure you will agree it’s impossible to interpret from a visual inspect alone! I’ve used NodeXL to produce this map. I find that NodeXL is an excellent tool for the early analysis and to produce some insights that can be examined later.

The next map is also produced in NodeXL, but this time Gloor’s Contribution Index has been applied to data. The power of this diagram is it removes the “noise” from the network, and we can start to see some network behaviour.

Gloor Network 

In my previous post I provided an outline of how I’m doing a large and complex SNA on a knowledge exchange organistion that has a world-wide presence, a membership of about 2,500, and the use of multiple platforms. In this post I’ll talk about data cleansing, and Gloor’s Contribution Index as a way of attributing data.

The task is challenging not the least because I have 10 complete years of data, and two years of incomplete data to analyse. It’s also a daunting task, but I was able to quickly reduce the dataset to 10,576 rows and 7 columns which had to be cleaned and manipulated! My tool of choice for a dataset of this size, at least for the initial cleaning and manipulation stage, is Microsoft Excel. Excel has some very good capabilities including a =CLEAN command to remove non-printable hidden characters that cause problems in analysis tools.

The dataset contained 10,354 posts. 7,238 were reply posts. Of these "Anonymous" posted 1,999 replies to 1,374 posts. This represents about 18% of all posts. However, it was necessary to remove "Anonymous" from the dataset, because "Anonymous" is almost certainly not a single person, and to leave them in would distort the results. Similarly, identified pseudonyms, aliases, and duplicate names, along with “self-replies” and no answers were removed. Ultimately this process left 703 identified individuals in the network. These people comprise the node-set for the public bounded or contained network, for which activity and various network measures can be applied.

One of the first measures applied was Gloor’s Contribution Index (messages sent – messages received)/(messages sent + messages received). It is interpreted as follows:

Conducting a Complex SNA

complex networkI’m currently doing half a dozen social network analysis (SNA) studies. They all vary in size and complexity, which changes my approach and the tools I use. The most complex one involves a knowledge exchange organistion that has a world-wide presence, a membership of about 2,500, and the use of multiple platforms. It is an interesting but difficult study.

The purpose of the study is threefold, as follows:

first, to better understand the organisation in terms of issues such as identity, relationships, function and role;

second, to better understand the relationship of its special interest groups to each other and the main knowledge exchange community; and

third, to better understand where the main knowledge exchange community sits in comparison with other networks to help them appreciate where they are unique, where they are in terms of a traditional life-cycle, and how they might evolve.

I expect the outcomes and results of the study will result in recommendations for practical methods to support the growth and enrichment of the community: at least I hope so! So how will I go about it?

Copyright © 2004 -2012 Knowledge Matters™ - all rights reserved
The Webpages and Occasional Blog of Graham Durant-Law
E-mail: graham@durantlaw.info

Syndicate content

Clicky