Network Overview Discovery and Exploration for Excel 2007/2010
NodeXL provides support for social network analysis in the context of a spreadsheet.
See: http://www.codeplex.com/nodexl
NodeXL is a project from the Social Media Research Foundation and is a collaboration among:
- Connected Action Consulting Group
- Microsoft Research
- University of Maryland
- Cornell University
- Stanford University
- Oxford Internet Institute
NodeXL is the free and open add-in for Excel that supports network overview, discovery and exploration. The code and application can be found at http://www.codeplex.com/nodexl.
NodeXL requires Office 2007 or 2010. Other versions of Excel (like 2008 on Mac, or the older 2003) do not work with NodeXL (sorry!).
A video tutorial for NodeXL can be found at: http://www.connectedaction.net/2009/11/11/video-using-nodexl-to-map-the-digg-mentioning-twitter-population/
A manuscript tutorial guide to NodeXL can be found athttp://casci.umd.edu/images/4/46/NodeXL_tutorial_draft.pdf
A book Analyzing Social Media Networks with NodeXL: Insights from a connected world is available from Morgan-Kaufmann:

Analyzing Social Media Networks with NodeXL: Insights from a Connected World
Supporting data sets can be found at http://casci.umd.edu/NodeXL_Teaching.
Information about NodeXL can often be found on the Connected Action blog (http://www.connectedaction.net).
A recent slide deck describing NodeXL can be found at:http://www.slideshare.net/Marc_A_Smith/2009-december-nodexl-overview.
NodeXL allows for the import of network data in the form of edge lists, matricies, graphML, UCINet, and Pajek files along with CSV and other workbooks.
NodeXL allows non-programmers to quickly generate useful network statistics and metrics and create visualizations of network graphs. Filtering and display attributes can be used to highlight important structures in the network.
NodeXL supports the exploration of social media with import features that pull data from personal email indexes on the desktop, twitter, flickr, youtube, and, soon, facebook and WWW hyperlinks.
Recent features added to NodeXL include faster metrics calculation, larger data sets, new layouts, scales, axes, and legends.
NodeXL has been downloaded more than 64,000 times and is becoming the easiest path to getting insights from network data.
Social Media Network Research Related Publications
In the Journal of Social Structure: “Visualizing the Signatures of Social Roles in Online Discussion Groups” is available from: http://www.cmu.edu/joss/content/articles/volume8/Welser/It illustrates different patterns of network structures associated with different kinds of roles and behaviors.
Abstract: Social roles in online discussion forums can be described by patterned characteristics of communication between network members which we conceive of as ‘structural signatures.’ This paper uses visualization methods to reveal these structural signatures and regression analysis to confirm the relationship between these signatures and their associated roles in Usenet newsgroups. Our analysis focuses on distinguishing the signatures of one role from others, the role of “answer people.” Answer people are individuals whose dominant behavior is to respond to questions posed by other users. We found that answer people predominantly contribute one or a few messages to discussions initiated by others, are disproportionately tied to relative isolates, have few intense ties and have few triangles in their local networks. OLS regression shows that these signatures are strongly correlated with role behavior and, in combination, provide a strongly predictive model for identifying role behavior (R2=.72). To conclude, we consider strategies for further improving the identification of role behavior in online discussion settings and consider how the development of a taxonomy of author types could be extended to a taxonomy of newsgroups in particular and discussion systems in general.
“Discussion catalysts in online political discussions: Content importers and conversation starters“ in the Journal of Computer-Mediated Communication (JCMC) http://jcmc.indiana.edu/athttp://ping.fm/7NF5T
Abstract: This study addresses 3 research questions in the context of online political discussions: What is the distribution of successful topic starting practices, what characterizes the content of large thread-starting messages, and what is the source of that content? A 6-month analysis of almost 40,000 authors in 20 political Usenet newsgroups identified authors who received a disproportionate number of replies. We labeled these authors ‘‘discussion catalysts.’’ Content analysis revealed that 95 percent of discussion catalysts’ messages contained content imported from elsewhere on the web, about 2/3 from traditional news organizations. We conclude that the flow of information from the content creators to the readers and writers continues to be mediated by a few individuals who act as filters and amplifiers.
Smith, M., Shneiderman, B., Milic-Frayling, N., Rodrigues, E.M., Barash, V., Dunne, C., Capone, T., Perer, A. & Gleave, E. (2009),”Analyzing (Social Media) Networks with NodeXL“, In C&T ’09: Proceedings of the Fourth International Conference on Communities and Technologies. Springer.
Abstract: In this paper we present NodeXL, an extendible toolkit for network data analysis and visualization, implemented as an add-in to the Microsoft Excel 2007 spreadsheet software. We demonstrate NodeXL features through analysis of a data sample drawn from an enterprise intranet social network, discussion, and wiki. Through a sequence of steps we show how NodeXL leverages and extends the broadly used spreadsheet paradigm to support common operations in network analysis. This ranges from data import to computation of network statistics and refinement of network visualization through a selection of ready-to-use sorting, filtering, and clustering functions.
Howard Welser, Eric Gleave, Marc Smith, Vladimir Barash, Jessica Meckes. “Whither the Experts? Social affordances and the cultivation of experts in community Q&A systems”, in SIN ’09: Proc. international symposium on Social Intelligence and Networking. IEEE Computer Society Press.
Abstract: Community based Question and Answer systems have been promoted as web 2.0 solutions to the problem of finding expert knowledge. This promise depends on systems’ capacity to attract and sustain experts capable of offering high quality, factual answers. Content analysis of dedicated contributors’ messages in the Live QnA system found: (1) few contributors who focused on providing technical answers (2) a preponderance of attention paid to opinion and discussion, especially in non-technical threads. This paucity of experts raises an important general question: how do the social affordances of a site alter the ecology of roles found there? Using insights from recent research in online community, we generate a series of expectations about how social affordances are likely to alter the role ecology of online systems.
Bonsignore, E.M., Dunne, C., Rotman, D., Smith, M., Capone, T., Hansen, D.L. & Shneiderman, B. (2009), ”First steps to NetViz Nirvana: evaluating social network analysis with NodeXL“, In SIN ’09: Proc. international symposium on Social Intelligence and Networking. IEEE Computer Society Press.
Abstract: Social Network Analysis (SNA) has evolved as a popular, standard method for modeling meaningful, often hidden structural relationships in communities. Existing SNA tools often involve extensive pre-processing or intensive programming skills that can challenge practitioners and students alike. NodeXL, an open-source template for Microsoft Excel, integrates a library of common network metrics and graph layout algorithms within the familiar spreadsheet format, offering a potentially low-barrier to-entry framework for teaching and learning SNA. We present the preliminary findings of 2 user studies of 21 graduate students who engaged in SNA using NodeXL. The majority of students, while information professionals, had little technical background or experience with SNA techniques. Six of the participants had more technical backgrounds and were chosen specifically for their experience with graph drawing and information visualization. Our primary objectives were (1) to evaluate NodeXL as an SNA tool for a broad base of users and (2) to explore methods for teaching SNA. Our complementary dual case-study format demonstrates the usability of NodeXL for a diverse set of users, and significantly, the power of a tightly integrated metrics/visualization tool to spark insight and facilitate sensemaking for students of SNA.
Hansen, D., Rotman, D., Bonsignore, E., Milic-Frayling, N., Rodrigues, E., Smith, M., Shneiderman, B. (July 2009)
Do You Know the Way to SNA?: A Process Model for Analyzing and Visualizing Social Media Data
University of Maryland Tech Report: HCIL-2009-17
Abstract: Voluminous online activity data from users of social media can shed light on individual behavior, social relationships, and community efficacy. However, tools and processes to analyze this data are just beginning to evolve. We studied 15 graduate students who were taught to use NodeXL to analyze social media data sets. Based on these observations, we present a process model of social network analysis (SNA) and visualization, then use it to identify stages where intervention from peers, experts, and computational aids are most useful. We offer implications for designers of SNA tools, educators, and community & organizational analysts.



































3 responses so far ↓
1 AndreLuizJPB (André Luiz) // Jun 28, 2010 at 5:34 pm
@raquelrecuero Vi você na rede exemplo do site do NodeXL. http://www.connectedaction.net/nodexl/
2 Marketta Ray // Dec 7, 2011 at 10:35 am
Greetings. I work for an external evaluation company, and we are needing to use a social networking analysis tool. Your product has caught our eye. I have a few questions that I hope you can take the time to help me with.
1) What format does the data need to be in, in order to do the analysis?
We currently use survey software where we can send out to the participants to get data.
Thank you for your time and I’m looking forward to hearing from you.
Marketta Ray
East Main Educational Consulting, LLC
3 Marc Smith // Dec 7, 2011 at 12:52 pm
NodeXL consumes multiple formats:
_Importing Graph Data
You can import graph data into a NodeXL workbook from a variety of sources in a variety of formats.
Imported graph data normally overwrites any graph data that is already in the workbook, but you can change this so that imported data gets appended to existing data instead. Appending graph data can lead to confusing results—in particular, you can end up with multiple rows in the Vertices worksheet for the same vertex—so this option is intended for advanced users only.
To tell NodeXL to append imported graph data to existing data:
In the Ribbon, select NodeXL, Data, Import.
Uncheck Clear NodeXL Workbook First.
_Importing Graph Data from Other Programs
NodeXL can import graph data in a number of file formats that are used by other graph programs, so that you can, for example, create a graph in UCINET and then import and view the graph in NodeXL. The supported formats are shown in the table below.
File Format Description
UCINET Full Matrix DL This is the only UCINET file format that can be imported into NodeXL. If you have a UCINET file in a different format, such as nodelist1, rankedlist1 or dataset, select the “What if my UCINET file is not in full matrix DL format” link in the Import from UCINET Full Matrix DL File dialog box for instructions.
Importing a UCINET full matrix DL file into NodeXL adds an Edge Weight column to the Edges worksheet.
GraphML GraphML is an industry-standard graph file format supported by a number of graph programs.
GraphML supports arbitrary vertex and edge attributes. When importing a GraphML file, NodeXL adds a column to the Vertices or Edges worksheet for each attribute in the file.
Pajek NodeXL can import files created by the Pajek program. Importing a Pajek file adds an Edge Weight column to the Edges worksheet. Any other edge or vertex attributes in the Pajek file are ignored.
To import graph data from another program:
In the Ribbon, select NodeXL, Data, Import.
Select from the second group of items on the Import menu.
_Importing Graph Data from Another Workbook
You can import graph data that is stored in another open Excel workbook in either matrix format or as an edge list. In either case, the other workbook must already be opened in Excel; NodeXL will not open it for you.
To import graph data from an Excel workbook that contains a matrix:
In the Ribbon, select NodeXL, Data, Import, From Open Matrix Workbook.
Follow the instructions in the Import from Open Matrix Workbook dialog box.
To import graph data from an Excel workbook that contains an edge list:
In the Ribbon, select NodeXL, Data, Import, From Open Workbook.
Follow the instructions in the Import from Open Workbook dialog box.
_Importing Graph Data from Online Social Networks
NodeXL can analyze social networks on Twitter, Flickr and YouTube, and then import the results as graph data into the NodeXL workbook. You can, for example, import a Twitter network of people whose tweets contain a specified hashtag, or a Flickr network of the people who have commented on someone’s photos, or the YouTube network of videos that are tagged with a specified keyword.
Because these features access external services to obtain their data, there are a few restrictions involved in their use. For example, the Twitter features will work if you use them anonymously, but they will work faster if you authorize NodeXL to use your Twitter account to obtain the Twitter data. And the Flickr features require something called a Flickr API key, which you can obtain from Flickr. These restrictions are explained within the features’ dialog boxes.
To analyze an online social network and import the results as graph data:
In the Ribbon, select NodeXL, Data, Import.
Select from the second-to-last group of items on the Import menu.
_Importing Graph Data from Email
If you use a desktop-based email program such as Outlook, Windows Mail or Outlook Express on Windows 7 or Vista, you can tell NodeXL to analyze the relationships among the people you communicate with via email, and then import the results as graph data into the NodeXL workbook. If you communicated 117 times with Bill via email, for example, the graph will include vertices for you and Bill, with a connecting edge that has an Edge Weight of 117.
NodeXL will not analyze Web-based email.
If you use Windows XP, you have to install Windows Search before NodeXL will analyze your email.
To analyze your email and import the results as graph data:
In the Ribbon, select NodeXL, Data, Import, From Email Network.
Select options in the Import from Email Network dialog box.
To obtain more information about how NodeXL analyzes email:
In the Import from Email Network dialog box, select the “How email is analyzed and imported” link.
Leave a Comment