Advice for the rank beginner part 2
Ok, so I had you go get the data before really talking about what you might do with it. Let's look at what we have.
article <has author> author1, author2,... authorn
authorn <has address> institution, city, state, country, zip code
article <cites> article1, article2,... articlen
... other citation stuff like journals, dates...
we can build a co-authorship network.
article1 <has author> author1
article1 <has author> author2
therefore there is a connection between author1 --- author2, some similarity or communication between the two.
you can extend this
article1 <has author> author1 and author1 <has address> institution1, country1
article1 <has author> author2 and author2 <has address> institution2, country2
Therefore there is a connection institution1 -- institution2 or country1 -- country2
Co-citation is
article1 <cites> article2
article3 <cites> article2
therefore there might be some similarity between articles 1 and 3.
You can also call both strings of citations vectors and then measure their similarity using some sort of Euclidean measure or other, but I haven't actually done that so I'll leave it for someone else to explain.
Still to come (and not tonight!)
- exporting from Sitkis
- importing into UCInet or NetDraw (comes with UCInet)
- drawing pretty pictures
- some simple measures
- using a citation manager to get co-authorship data
- where to go for more information (and people who know more about this than I ever will)