Kardashian index plot for data available in the Brede Wiki

Around 90 researchers listed in the Brede Wiki have both an Google Scholar identifier and and Twitter identifier associated with them. These can be extracted and Google Scholar and Twitter can be queried to get citation count and followers count so the Kardashian index (K-index) can be computed.

See the plot and the Kardashian index at the bottom.

The list of the included researchers is here: http://neuro.compute.dtu.dk/wiki/Category:Researchers_in_Google_Scholar_and_Twitter

Read more about the Kardashian index here: http://neuro.compute.dtu.dk/wiki/Kardashian_index

New coefficients for power law for Google Scholar/Twitter are computed. The power law is 50.5 x citations ^ 0.28.

To run the below code you need to setup a python.cfg configuration file with the four Twitter tokens, user-agent and 'from' email.

In [2]:
from __future__ import division, print_function
try:
    import ConfigParser as configparser
except ImportError:
    import configparser
import copy
import json
import matplotlib
import matplotlib.pyplot as plt
import numpy as np
import os
import pandas as pd
import re
import requests
import scipy.spatial
import time
import tweepy
from urllib import quote
In [2]:
# Read configuration file for request user-agent and Twitter tokens
config = configparser.ConfigParser()
config.read(os.path.expanduser('~/etc/python.cfg'))

headers = {
    'User-Agent': config.get('requests', 'user_agent'),
    'From': config.get('requests', 'from')
}
In [3]:
# Fetch name of Brede Wiki page with researchers on Google Scholar and Twitter
url_gst = ('http://neuro.compute.dtu.dk/w/api.php?'
           'action=query&format=json&list=categorymembers&'
           'cmtitle=Category:Researchers in Google Scholar and Twitter')

response = requests.get(url_gst, headers=headers).json()
pagetitles = [page['title'] for page in response['query']['categorymembers']]
while 'query-continue' in response:
    url_continue = url_gst + '&cmcontinue=' + response['query-continue']['categorymembers']['cmcontinue'].encode('utf-8')
    response = requests.get(url_continue).json()
    pagetitles.extend([page['title'] for page in response['query']['categorymembers']])
    
In [4]:
print(pagetitles)
[u'Aaron Clauset', u'Aaron Halfaker', u'Alan Mislove', u'Alan Ruttenberg', u'Albert Gjedde', u'Alex O. Holcombe', u'Alex Verstak', u'Alon Halevy', u'Amy Bruckman', u'Anders Stockmarr', u'Andrea Cuttone', u'Andrea Forte', u'Andreas Roepstorff', u'Andrius Butkus', u'Antonia F. de C. Hamilton', u'Arkadiusz Stopczynski', u'Arto Lanam\xe4ki', u'Benjamin Mako Hill', u'Bjarne \xd8rum Fruergaard', u'Brian Keegan', u'C. Lee Giles', u'Carsten Stahlhut', u'Chitu Okoli', u'Chris D. Frith', u'Christian Kohlsch\xfctter', u'Cliff Lampe', u'Cyril Goutte', u'Dan Jurafsky', u'Daniel M. Romero', u'Daniel Mietchen', u'Dario Taraborelli', u'David D. Cox', u'David Laniado', u'David M. Blei', u'David M. Pennock', u'David N. Kennedy', u'Denny Vrande\u010di\u0107', u'Dirk Riehle', u'Dominik Benz', u'Ed H. Chi', u'Edward T. Bullmore', u'Egon Willighagen', u'Elanor Colleoni', u'Emilio Ferrara', u'Eszter Hargittai', u'Felipe Ortega', u'Finn \xc5rup Nielsen', u'Frank G. A. de Bakker', u'Friederike Schultz', u'Ga\xebl Varoquaux', u'Geoffrey Karl Aguirre', u'Geraint Rees', u'Haewoon Kwak', u'Henry Markram', u'J. Andreas B\xe6rentzen', u'J. Douglas Bremner', u'Jakob Eg Larsen', u'James W. Pennebaker', u'Jan Larsen (DTU Compute)', u'Jens Madsen', u'Joseph R. Kiniry', u'Jure Leskovec', u'Kasper Winther Andersen', u'Katrin Weller', u'Lada A. Adamic', u'Lars Kai Hansen', u'Lars Ramkilde Knudsen', u'Lasse L. M\xf8lgaard', u'Leon French', u'Luca de Alfaro', u'Magnus Manske', u'Marcus Messner', u'Maria Grineva', u'Matthew G. Liptrot', u'Michael Cook', u'Michael Kai Petersen', u'Mikkel N. Schmidt', u'Mikkel Wallentin', u'Morten M\xf8rup', u'Nikola Milikic', u'Noshir S. Contractor', u'Olaf Sporns', u'Paolo Massa', u'Paul Groth', u'Peter Sheridan Dodds', u'Peter Vuust', u'Petter Holme', u'Phoebe Ayers', u'Piotr Konieczny', u'R. Stuart Geiger', u'Rada Mihalcea', u'Radu Dr\u0103gu\u0219in', u'Rapha\xebl Troncy', u'Renaud Lambiotte', u'Roderick Murray-Smith', u'Russell A. Poldrack', u'Ryota Kanai', u'Ryota Tomioka', u'Sarita Yardi', u'Simon Due Kamronn', u'Stanislas Dehaene', u'Stefania Serafin', u'Stephan Gouws', u'Sue B. Moon', u'Sune Lehmann', u'Taha Yasseri', u'Tal Yarkoni', u'Thomas Steiner', u'Tobias S. Andersen', u'Tom M. Mitchell', u'Trine Julie Abrahamsen', u'Uta Frith', u'Vedran Sekara', u'Vibe Geds\xf8 Fr\xf8kj\xe6r', u'Vince Calhoun', u'Wray Buntine', u'Yong-Yeol Ahn']
In [5]:
# Get researcher data from Brede Wiki
url_pages = "http://neuro.compute.dtu.dk/w/index.php?action=raw&title="
pattern_researcher = re.compile('{{Researcher(\s*?\|.*?)}}', 
                           re.DOTALL | re.IGNORECASE | re.UNICODE)
pattern_fields = re.compile(r'\s*\|\s*(\w+)\s*=\s*([^\|]*\w)', 
                            re.DOTALL | re.UNICODE)

researchers = []
for pagetitle in pagetitles:
    response = requests.get(url_pages + quote(pagetitle.encode('utf-8')), headers=headers)
    print(pagetitle)
    researcher = pattern_researcher.findall(response.text)
    if researcher:
        researchers.append(dict(pattern_fields.findall(researcher[0])))
    else:
        print("Problem with " + pagetitle)
Aaron Clauset
Aaron Halfaker
Alan Mislove
Alan Ruttenberg
Albert Gjedde
Alex O. Holcombe
Alex Verstak
Alon Halevy
Amy Bruckman
Anders Stockmarr
Andrea Cuttone
Andrea Forte
Andreas Roepstorff
Andrius Butkus
Antonia F. de C. Hamilton
Arkadiusz Stopczynski
Arto Lanamäki
Benjamin Mako Hill
Bjarne Ørum Fruergaard
Brian Keegan
C. Lee Giles
Carsten Stahlhut
Chitu Okoli
Chris D. Frith
Christian Kohlschütter
Cliff Lampe
Cyril Goutte
Dan Jurafsky
Daniel M. Romero
Daniel Mietchen
Dario Taraborelli
David D. Cox
David Laniado
David M. Blei
David M. Pennock
David N. Kennedy
Denny Vrandečić
Dirk Riehle
Dominik Benz
Ed H. Chi
Edward T. Bullmore
Egon Willighagen
Elanor Colleoni
Emilio Ferrara
Eszter Hargittai
Felipe Ortega
Finn Årup Nielsen
Frank G. A. de Bakker
Friederike Schultz
Gaël Varoquaux
Geoffrey Karl Aguirre
Geraint Rees
Haewoon Kwak
Henry Markram
J. Andreas Bærentzen
J. Douglas Bremner
Jakob Eg Larsen
James W. Pennebaker
Jan Larsen (DTU Compute)
Jens Madsen
Joseph R. Kiniry
Jure Leskovec
Kasper Winther Andersen
Katrin Weller
Lada A. Adamic
Lars Kai Hansen
Lars Ramkilde Knudsen
Lasse L. Mølgaard
Leon French
Luca de Alfaro
Magnus Manske
Marcus Messner
Maria Grineva
Matthew G. Liptrot
Michael Cook
Michael Kai Petersen
Mikkel N. Schmidt
Mikkel Wallentin
Morten Mørup
Nikola Milikic
Noshir S. Contractor
Olaf Sporns
Paolo Massa
Paul Groth
Peter Sheridan Dodds
Peter Vuust
Petter Holme
Phoebe Ayers
Piotr Konieczny
R. Stuart Geiger
Rada Mihalcea
Radu Drăgușin
Raphaël Troncy
Renaud Lambiotte
Roderick Murray-Smith
Russell A. Poldrack
Ryota Kanai
Ryota Tomioka
Sarita Yardi
Simon Due Kamronn
Stanislas Dehaene
Stefania Serafin
Stephan Gouws
Sue B. Moon
Sune Lehmann
Taha Yasseri
Tal Yarkoni
Thomas Steiner
Tobias S. Andersen
Tom M. Mitchell
Trine Julie Abrahamsen
Uta Frith
Vedran Sekara
Vibe Gedsø Frøkjær
Vince Calhoun
Wray Buntine
Yong-Yeol Ahn
In [6]:
researchers[14]
Out[6]:
{u'affiliation1': u'Institute of Cognitive Neuroscience',
 u'affiliation2': u'School of Psychology, University of Nottingham',
 u'affiliation3': u'Sobell Department of Motor Neuroscience and Movement Disorders',
 u'country': u'United Kingdom',
 u'family_name': u'Hamilton',
 u'googlescholar': u'vNqtGKcAAAAJ',
 u'initials': u'AF',
 u'name': u'Antonia Hamilton',
 u'position': u'Senior Lecturer',
 u'scopus': u'7202618686',
 u'scopus1': u'38060964400',
 u'twitter': u'antoniahamilton',
 u'url1': u'http://www.antoniahamilton.com/people.html'}
In [7]:
url_gs = 'http://scholar.google.com/citations?user='

headers = {
    'User-Agent': config.get('requests', 'user_agent'),
    'From': config.get('requests', 'from')
}

# <td class="cit-borderleft cit-data">2537</td>
pattern_gscount = re.compile('<td class="gsc_rsb_std">(\d+)</td>')

def get_google_scholar_counts(google_scholar_id):
    response = requests.get(url_gs + google_scholar_id, 
                            headers=headers)
    counts = dict(zip(['citations', 'citations5', 'h-index', 'h-index5',
                        'i10-index', 'i10-index5'],
                        map(int, pattern_gscount.findall(response.text))))
    return counts
    
In [8]:
# Yong-Yeol Ahn check
get_google_scholar_counts('US7OSNgAAAAJ')
Out[8]:
{'citations': 3379,
 'citations5': 3181,
 'h-index': 16,
 'h-index5': 16,
 'i10-index': 18,
 'i10-index5': 17}
In [9]:
# Get data from Google Scholar
for researcher in researchers:
    if 'citations' not in researcher:
        print(researcher['name'])
        researcher.update(get_google_scholar_counts(researcher['googlescholar']))
        time.sleep(5)  
    
Aaron Clauset
Aaron Halfaker
Alan Mislove
Alan Ruttenberg
Albert Gjedde
Alex O. Holcombe
Alex Verstak
Alon Halevy
Amy Bruckman
Anders Stockmarr
Andrea Cuttone
Andrea Forte
Andreas Roepstorff
Andrius Butkus
Antonia Hamilton
Arkadiusz Stopczynski
Arto Lanamäki
Benjamin Mako Hill
Bjarne Ørum Fruergaard
Brian Keegan
C. Lee Giles
Carsten Stahlhut
Chitu Okoli
Chris D. Frith
Christian Kohlschütter
Cliff Lampe
Cyril Goutte
Dan Jurafsky
Daniel M. Romero
Daniel Mietchen
Dario Taraborelli
David D. Cox
David Laniado
David M. Blei
David M. Pennock
David N. Kennedy
Denny Vrandečić
Dirk Riehle
Dominik Benz
Ed H. Chi
Edward T. Bullmore
Egon Willighagen
Elanor Colleoni
Emilio Ferrara
Eszter Hargittai
Felipe Ortega
Finn Årup Nielsen
Frank G. A. de Bakker
Friederike Schultz
Gaël Varoquaux
Geoffrey Karl Aguirre
Geraint Rees
Haewoon Kwak
Henry Markram
J. Andreas Bærentzen
J. Douglas Bremner
Jakob Eg Larsen
James W. Pennebaker
Jan Larsen
Jens Madsen
Joseph R Kiniry
Jure Leskovec
Kasper Winther Andersen
Katrin Weller
Lada A. Adamic
Lars Kai Hansen
Lars Ramkilde Knudsen
Lasse L. Mølgaard
Leon French
Luca de Alfaro
Heinrich Magnus Manske
Marcus Messner
Maria Grineva
Matthew G. Liptrot
Michael Cook
Michael Kai Petersen
Mikkel N. Schmidt
Mikkel Wallentin
Morten Mørup
Nikola Milikic
Noshir S. Contractor
Olaf Sporns
Paolo Massa
Paul Groth
Peter Sheridan Dodds
Peter Vuust
Petter Holme
Phoebe Ayers
Piotr Konieczny
R. Stuart Geiger
Rada Mihalcea
Radu Drăgușin
Raphaël Troncy
Renaud Lambiotte
Roderick Murray-Smith
Russell A. Poldrack
Ryota Kanai
Ryota Tomioka
Sarita Yardi Schoenebeck
Simon Due Kamronn
Stanislas Dehaene
Stefania Serafin
Stephan Gouws
Sue B. Moon
Sune Lehmann
Taha Yasseri
Tal Yarkoni
Thomas Steiner
Tobias S. Andersen
Tom M. Mitchell
Trine Julie Abrahamsen
Uta Frith
Vedran Sekara
Vibe Gedsø Frøkjær
Vince Calhoun
Wray Buntine
Yong-Yeol Ahn
In [10]:
# Saving just in case
json.dump(researchers, open('researchers.json', 'w'))
In [11]:
# Twitter authentication
auth = tweepy.OAuthHandler(config.get('twitter', 'consumer_key'),
                           config.get('twitter', 'consumer_secret'))
auth.set_access_token(config.get('twitter', 'access_token'),
                      config.get('twitter', 'access_secret'))
In [12]:
# Function to download data from Twitter profiles
api = tweepy.API(auth)

def get_twitter_count(twitter_id):
    try:
        user = api.get_user(twitter_id)
        counts = {
            'Followers count': user.followers_count,
            'Friends count': user.friends_count,
            'Statuses count': user.statuses_count
            }
        return counts
    except Exception:
        print('Problem with ' + twitter_id)
        return {}
In [13]:
# Testing with Finn Aarup Nielsen (fnielsen)
get_twitter_count('fnielsen')
Out[13]:
{'Followers count': 411, 'Friends count': 351, 'Statuses count': 2569}
In [14]:
# Download data from Twitter
for researcher in researchers:
    researcher.update(get_twitter_count(researcher['twitter']))
    print(researcher['name'])
Aaron Clauset
Aaron Halfaker
Alan Mislove
Alan Ruttenberg
Albert Gjedde
Alex O. Holcombe
Alex Verstak
Alon Halevy
Amy Bruckman
Anders Stockmarr
Andrea Cuttone
Andrea Forte
Andreas Roepstorff
Andrius Butkus
Antonia Hamilton
Arkadiusz Stopczynski
Arto Lanamäki
Benjamin Mako Hill
Bjarne Ørum Fruergaard
Brian Keegan
C. Lee Giles
Carsten Stahlhut
Chitu Okoli
Chris D. Frith
Christian Kohlschütter
Cliff Lampe
Cyril Goutte
Dan Jurafsky
Daniel M. Romero
Daniel Mietchen
Dario Taraborelli
David D. Cox
David Laniado
David M. Blei
David M. Pennock
David N. Kennedy
Denny Vrandečić
Dirk Riehle
Dominik Benz
Ed H. Chi
Edward T. Bullmore
Egon Willighagen
Elanor Colleoni
Emilio Ferrara
Eszter Hargittai
Felipe Ortega
Finn Årup Nielsen
Frank G. A. de Bakker
Friederike Schultz
Gaël Varoquaux
Geoffrey Karl Aguirre
Geraint Rees
Haewoon Kwak
Henry Markram
J. Andreas Bærentzen
J. Douglas Bremner
Jakob Eg Larsen
James W. Pennebaker
Jan Larsen
Jens Madsen
Joseph R Kiniry
Jure Leskovec
Kasper Winther Andersen
Katrin Weller
Lada A. Adamic
Lars Kai Hansen
Lars Ramkilde Knudsen
Lasse L. Mølgaard
Leon French
Luca de Alfaro
Heinrich Magnus Manske
Marcus Messner
Maria Grineva
Matthew G. Liptrot
Michael Cook
Michael Kai Petersen
Mikkel N. Schmidt
Mikkel Wallentin
Morten Mørup
Nikola Milikic
Noshir S. Contractor
Olaf Sporns
Paolo Massa
Paul Groth
Peter Sheridan Dodds
Peter Vuust
Petter Holme
Phoebe Ayers
Piotr Konieczny
R. Stuart Geiger
Rada Mihalcea
Radu Drăgușin
Raphaël Troncy
Renaud Lambiotte
Roderick Murray-Smith
Russell A. Poldrack
Ryota Kanai
Ryota Tomioka
Sarita Yardi Schoenebeck
Simon Due Kamronn
Stanislas Dehaene
Stefania Serafin
Stephan Gouws
Sue B. Moon
Sune Lehmann
Taha Yasseri
Tal Yarkoni
Thomas Steiner
Tobias S. Andersen
Tom M. Mitchell
Trine Julie Abrahamsen
Uta Frith
Vedran Sekara
Vibe Gedsø Frøkjær
Vince Calhoun
Wray Buntine
Yong-Yeol Ahn
In [15]:
# Save just in case
json.dump(researchers, open('researchers.json', 'w'))
In [3]:
researchers = json.load(open('researchers.json'))
In [4]:
researchers[0]
Out[4]:
{u'Followers count': 1733,
 u'Friends count': 95,
 u'Statuses count': 575,
 u'affiliation1': u'University of Colorado at Boulder',
 u'citations': 8346,
 u'citations5': 7446,
 u'country': u'United States of America',
 u'family_name': u'Clauset',
 u'googlescholar': u'e7VI_HcAAAAJ',
 u'h-index': 20,
 u'h-index5': 18,
 u'i10-index': 28,
 u'i10-index5': 27,
 u'initials': u'A',
 u'name': u'Aaron Clauset',
 u'position': u'Assistant Professor',
 u'scopus': u'8298347800',
 u'twitter': u'aaronclauset',
 u'url1': u'http://tuvalu.santafe.edu/~aaronc',
 u'url2': u'http://cs.unm.edu/~aaron/blog',
 u'videolectures': u'aaron_clauset'}
In [5]:
# Pandas!
df = pd.DataFrame(researchers)
In [6]:
%matplotlib inline
In [17]:
# isnan: Houston, we've had a problem
indices = (~np.isnan(df['citations'])) & (df['citations'] != 0)
reverse_index = indices[indices].index.values

# Plot the data
matplotlib.rc('font', family='DejaVu Sans')
fig = df.plot(x='citations', y='Followers count',
                       kind='scatter', figsize=(15, 10), 
                       marker='*', s=df['Statuses count']/10,
                       linewidth=2, color=(0.8, 0.8, 0.8))
ax = plt.gca()
ax.set_xscale('log')
ax.set_yscale('log')
plt.xlabel('Google Scholar citations')
plt.ylabel('Twitter followers count')
plt.title('Kardashian index for Brede Wiki researchers on Google Scholar and Twitter')

# Power law fit
p = np.polyfit(np.log(df.ix[indices, 'citations']), np.log(df.ix[indices, 'Followers count']), 1)
powerlaw = np.frompyfunc(lambda x: np.exp(p[1]) * x ** p[0], 1, 1)
plt.plot([1, 200000], powerlaw([1, 200000]), linewidth=5, color=(0.5, 1, 0.5))
plt.text(10, 5000, '{:.3} x citations^{:0.2}'.format(np.exp(p[1]), p[0]), fontsize=20)

# Annotation of some of the points with researcher names
hull = scipy.spatial.ConvexHull(df.ix[indices, ['citations', 'Followers count']])
for index in hull.vertices:
    x, y, name = df.ix[reverse_index[index], ['citations', 'Followers count', 'name']].values
    try:
        plt.text(x, y, name, horizontalalignment='center', verticalalignment='center')
    except:
        pass
    
# Myself and Ryoto and et al.
family_names = ['Nielsen', 'Tomioka', 'Willighagen']
for family_name in family_names:
    x, y, name = df.ix[df['family_name'] == family_name, ['citations', 'Followers count', 'name']].values.flatten()
    plt.text(x, y, name, horizontalalignment='center', verticalalignment='center')

dummy = plt.axis((1, 200000, 1, 20000))
plt.show()
In [18]:
df.describe()
Out[18]:
Followers count Friends count Statuses count citations citations5 h-index h-index5 i10-index i10-index5
count 117.000000 117.000000 117.000000 117.000000 117.000000 117.000000 117.000000 117.000000 117.000000
mean 1388.213675 374.042735 2564.042735 7694.487179 4507.735043 27.393162 22.649573 56.606838 47.188034
std 2407.753202 673.039777 4933.751046 15717.475907 7870.026290 29.289859 21.741156 82.538910 67.952561
min 1.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000
25% 73.000000 52.000000 40.000000 348.000000 308.000000 9.000000 9.000000 9.000000 8.000000
50% 504.000000 189.000000 551.000000 1815.000000 1393.000000 17.000000 16.000000 24.000000 20.000000
75% 1733.000000 435.000000 3053.000000 7424.000000 5018.000000 34.000000 30.000000 66.000000 57.000000
max 15224.000000 5915.000000 31695.000000 124042.000000 51905.000000 179.000000 117.000000 522.000000 415.000000

8 rows × 9 columns

In [19]:
df['K-index'] = df['Followers count'] / powerlaw(df['citations'])
In [20]:
# Identify the 'scientific Kardashians'
high_score = df[indices].sort(columns='K-index', ascending=False)[['name', 'K-index', 'Statuses count']]
high_score
Out[20]:
name K-index Statuses count
104 Sune Lehmann 43.35202 1994
43 Emilio Ferrara 29.38693 4680
61 Jure Leskovec 14.17319 575
111 Uta Frith 13.85914 3760
19 Brian Keegan 13.52433 17571
74 Michael Cook 13.20721 24813
71 Marcus Messner 10.85944 3086
44 Eszter Hargittai 10.48633 3050
29 Daniel Mietchen 10.21248 18336
72 Maria Grineva 10.20603 15734
64 Lada A. Adamic 9.837668 278
98 Sarita Yardi Schoenebeck 8.830271 15314
55 J. Douglas Bremner 7.902007 9638
107 Thomas Steiner 7.387838 6546
49 Gaël Varoquaux 7.323572 1326
39 Ed H. Chi 7.263597 7197
30 Dario Taraborelli 7.039645 4027
106 Tal Yarkoni 6.59077 2586
51 Geraint Rees 6.533561 1246
87 Phoebe Ayers 6.198842 2682
116 Yong-Yeol Ahn 6.043176 8292
17 Benjamin Mako Hill 5.815526 197
80 Noshir S. Contractor 5.768472 899
5 Alex O. Holcombe 5.398742 6329
41 Egon Willighagen 5.352792 31695
89 R. Stuart Geiger 5.329288 3053
8 Amy Bruckman 4.750928 3101
14 Antonia Hamilton 4.601453 1778
63 Katrin Weller 4.519743 3682
37 Dirk Riehle 4.273715 4747
92 Raphaël Troncy 4.229026 5685
25 Cliff Lampe 3.944443 3606
56 Jakob Eg Larsen 3.93527 2468
31 David D. Cox 3.797303 2709
105 Taha Yasseri 3.666922 723
96 Ryota Kanai 3.658165 2230
27 Dan Jurafsky 3.499375 68
15 Arkadiusz Stopczynski 3.370665 4556
95 Russell A. Poldrack 3.091737 551
16 Arto Lanamäki 3.045272 1050
0 Aaron Clauset 2.984011 575
36 Denny Vrandečić 2.847603 1250
45 Felipe Ortega 2.768661 1733
47 Frank G. A. de Bakker 2.726936 4753
11 Andrea Forte 2.69998 1428
83 Paul Groth 2.444785 4791
103 Sue B. Moon 2.39438 3346
7 Alon Halevy 2.314019 328
23 Chris D. Frith 2.212742 107
52 Haewoon Kwak 2.030797 11376
81 Olaf Sporns 1.851942 483
24 Christian Kohlschütter 1.684115 5203
70 Heinrich Magnus Manske 1.650236 2723
34 David M. Pennock 1.610894 834
86 Petter Holme 1.593025 3053
1 Aaron Halfaker 1.566837 692
84 Peter Sheridan Dodds 1.534204 5482
100 Stanislas Dehaene 1.496988 40
97 Ryota Tomioka 1.421273 4815
102 Stephan Gouws 1.345312 1428
... ... ...

116 rows × 3 columns

In [21]:
# Not all is shown above
# The below code will give the full list:
# https://stackoverflow.com/questions/23388810/ipython-notebook-output-cell-is-truncating-contents-of-my-list
from IPython.display import HTML
HTML(high_score.to_html())
Out[21]:
name K-index Statuses count
104 Sune Lehmann 43.35202 1994
43 Emilio Ferrara 29.38693 4680
61 Jure Leskovec 14.17319 575
111 Uta Frith 13.85914 3760
19 Brian Keegan 13.52433 17571
74 Michael Cook 13.20721 24813
71 Marcus Messner 10.85944 3086
44 Eszter Hargittai 10.48633 3050
29 Daniel Mietchen 10.21248 18336
72 Maria Grineva 10.20603 15734
64 Lada A. Adamic 9.837668 278
98 Sarita Yardi Schoenebeck 8.830271 15314
55 J. Douglas Bremner 7.902007 9638
107 Thomas Steiner 7.387838 6546
49 Gaël Varoquaux 7.323572 1326
39 Ed H. Chi 7.263597 7197
30 Dario Taraborelli 7.039645 4027
106 Tal Yarkoni 6.59077 2586
51 Geraint Rees 6.533561 1246
87 Phoebe Ayers 6.198842 2682
116 Yong-Yeol Ahn 6.043176 8292
17 Benjamin Mako Hill 5.815526 197
80 Noshir S. Contractor 5.768472 899
5 Alex O. Holcombe 5.398742 6329
41 Egon Willighagen 5.352792 31695
89 R. Stuart Geiger 5.329288 3053
8 Amy Bruckman 4.750928 3101
14 Antonia Hamilton 4.601453 1778
63 Katrin Weller 4.519743 3682
37 Dirk Riehle 4.273715 4747
92 Raphaël Troncy 4.229026 5685
25 Cliff Lampe 3.944443 3606
56 Jakob Eg Larsen 3.93527 2468
31 David D. Cox 3.797303 2709
105 Taha Yasseri 3.666922 723
96 Ryota Kanai 3.658165 2230
27 Dan Jurafsky 3.499375 68
15 Arkadiusz Stopczynski 3.370665 4556
95 Russell A. Poldrack 3.091737 551
16 Arto Lanamäki 3.045272 1050
0 Aaron Clauset 2.984011 575
36 Denny Vrandečić 2.847603 1250
45 Felipe Ortega 2.768661 1733
47 Frank G. A. de Bakker 2.726936 4753
11 Andrea Forte 2.69998 1428
83 Paul Groth 2.444785 4791
103 Sue B. Moon 2.39438 3346
7 Alon Halevy 2.314019 328
23 Chris D. Frith 2.212742 107
52 Haewoon Kwak 2.030797 11376
81 Olaf Sporns 1.851942 483
24 Christian Kohlschütter 1.684115 5203
70 Heinrich Magnus Manske 1.650236 2723
34 David M. Pennock 1.610894 834
86 Petter Holme 1.593025 3053
1 Aaron Halfaker 1.566837 692
84 Peter Sheridan Dodds 1.534204 5482
100 Stanislas Dehaene 1.496988 40
97 Ryota Tomioka 1.421273 4815
102 Stephan Gouws 1.345312 1428
79 Nikola Milikic 1.339853 107
109 Tom M. Mitchell 1.313142 21
82 Paolo Massa 1.306788 465
12 Andreas Roepstorff 1.143615 2042
28 Daniel M. Romero 1.042678 339
60 Joseph R Kiniry 1.036185 1480
75 Michael Kai Petersen 1.030835 154
46 Finn Årup Nielsen 0.9939271 2569
91 Radu Drăgușin 0.9096936 30
57 James W. Pennebaker 0.8613386 39
50 Geoffrey Karl Aguirre 0.7240992 357
32 David Laniado 0.6778401 58
3 Alan Ruttenberg 0.6029182 145
48 Friederike Schultz 0.5622809 4
38 Dominik Benz 0.5512532 1151
68 Leon French 0.4934968 162
18 Bjarne Ørum Fruergaard 0.4660093 30
101 Stefania Serafin 0.4615673 351
65 Lars Kai Hansen 0.4436704 1251
42 Elanor Colleoni 0.4111292 31
94 Roderick Murray-Smith 0.380659 554
67 Lasse L. Mølgaard 0.3693654 156
58 Jan Larsen 0.3333603 207
77 Mikkel Wallentin 0.3172575 24
73 Matthew G. Liptrot 0.3059698 506
112 Vedran Sekara 0.2842159 43
85 Peter Vuust 0.2595274 28
33 David M. Blei 0.242567 0
54 J. Andreas Bærentzen 0.2377323 104
93 Renaud Lambiotte 0.2345374 31
110 Trine Julie Abrahamsen 0.222549 9
69 Luca de Alfaro 0.2064739 85
59 Jens Madsen 0.2003703 7
10 Andrea Cuttone 0.1989996 1
13 Andrius Butkus 0.1978109 3
76 Mikkel N. Schmidt 0.1591836 25
62 Kasper Winther Andersen 0.1481508 88
88 Piotr Konieczny 0.1426637 158
53 Henry Markram 0.1372832 0
21 Carsten Stahlhut 0.110743 26
78 Morten Mørup 0.1098451 3
115 Wray Buntine 0.1090059 30
6 Alex Verstak 0.09618637 0
4 Albert Gjedde 0.0933135 162
66 Lars Ramkilde Knudsen 0.08555802 40
22 Chitu Okoli 0.08295347 376
108 Tobias S. Andersen 0.05471306 4
2 Alan Mislove 0.04338656 0
40 Edward T. Bullmore 0.03679437 0
114 Vince Calhoun 0.02110989 0
20 C. Lee Giles 0.02110068 1
113 Vibe Gedsø Frøkjær 0.0203848 8
35 David N. Kennedy 0.01667231 70
90 Rada Mihalcea 0.01475713 0
26 Cyril Goutte 0.004659881 0
9 Anders Stockmarr 0.003353294 0