S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture...

32
S701 Guest Lecture S701 Guest Lecture Dr. Katy Börner Cyberinfrastructure for Network Science Center , Director Information Visualization Laboratory, Director School of Library and Information Science Indiana University, Bloomington, IN k t @i di d katy@indiana.edu With special thanks to Kevin W. Boyack, Micah Linnemeier, Russell J. Duhon, Patrick Phillips, Joseph Biberstine, Chintan Tank Nianli Ma, Angela M. Zoss, Hanning Guo, Mark A. Price, Scott Weingart November 13, 2009 Three Readings Börner, K., Chen, C., & Boyack, K. (2003). Visualizing knowledge domains. In B. Cronin (Ed.), Annual Review of Information Science and Technology, 37 (pp. 179-255). Medford, NJ: Information Today. K D ll’A L K W &V i i A (2005) S di h rner, K., Dall’Asta, L., Ke, W., & Vespignani, A. (2005). Studying the emerging global brain: Analyzing and visualizing the impact of co- authorship teams. Complexity, 10(4), pp. 58-67. Börner, K., Huang, W., Linnemeier, M., Duhon, R. J., Phillips, P., Ma, N. et al. (2009). Rete-Netzwerk-Red: Analyzing and visualizing scholarly networks using the Scholarly Database and the Network Workbench Tool. In B. Larsen & J. Leta (Eds.), Proceedings of ISSI 2009: 12th International Conference on Scientometrics and Informetrics, Rio de Janeiro, Brazil, July 14-17, 2009, vol. 2 (pp. 619-630). Bireme/PAHO/WHO & the Federal University of Rio de Janeiro.

Transcript of S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture...

Page 1: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

S701 Guest LectureS701 Guest Lecture

Dr. Katy Börner Cyberinfrastructure for Network Science Center, Directory ,Information Visualization Laboratory, DirectorSchool of Library and Information ScienceIndiana University, Bloomington, INk t @i di [email protected]

With special thanks to Kevin W. Boyack, Micah Linnemeier, p yRussell J. Duhon, Patrick Phillips, Joseph Biberstine, Chintan TankNianli Ma, Angela M. Zoss, Hanning Guo, Mark A. Price, Scott Weingart

November 13, 2009

Three Readings

Börner, K., Chen, C., & Boyack, K. (2003). Visualizing knowledge domains. In B. Cronin (Ed.), Annual Review of Information Science and Technology, 37 (pp. 179-255). Medford, NJ: Information Today.

Bö K D ll’A L K W & V i i A (2005) S d i h Börner, K., Dall’Asta, L., Ke, W., & Vespignani, A. (2005). Studying the emerging global brain: Analyzing and visualizing the impact of co-authorship teams. Complexity, 10(4), pp. 58-67.

Börner, K., Huang, W., Linnemeier, M., Duhon, R. J., Phillips, P., Ma, N. et al. (2009). Rete-Netzwerk-Red: Analyzing and visualizing scholarly networks using the Scholarly Database and the Network Workbench Tool. In B. g yLarsen & J. Leta (Eds.), Proceedings of ISSI 2009: 12th International Conference on Scientometrics and Informetrics, Rio de Janeiro, Brazil, July 14-17, 2009, vol. 2 (pp. 619-630). Bireme/PAHO/WHO & the Federal University of Rio de Janeiro.

Page 2: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Process of Computational Scientometrics

, Topics

Börner, Katy, Chen, Chaomei, and Boyack, Kevin. (2003) Visualizing Knowledge Domains. In Blaise Cronin (Ed.), Annual R i f I f ti S i & T h l V l 37 M df d NJ I f ti T d I /A i S i t fReview of Information Science & Technology, Volume 37, Medford, NJ: Information Today, Inc./American Society for Information Science and Technology, chapter 5, pp. 179-255.

3

Needs-Driven Workflow Design using a modular data acquisition/analysis/modeling/visualization

i li ll d l i li i lpipeline as well as modular visualization layers.

Börner, Katy (2010) Atlas of Science. MIT Press. 4

Page 3: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

CI for a Science of Science Studies

Scholarly Database: 23 million scholarly recordshttp://sdb.slis.indiana.edu

Information Visualization Cyberinfrastructurehttp://iv.slis.indiana.edu

Network Workbench Tool + Community Wikihttp://nwb.slis.indiana.edu

Sci2 Tool and Science of Science CI Portalhttp://sci.slis.indiana.edu

Epidemics Cyberinfrastructurehttp://epic.slis.indiana.edu/

5

Sci2 Toolhttp://sci.slis.indiana.edup

“Open Code for S&T Assessment”Branded OSGi/CIShell based tool with NWB plugins p gand many new plugins.

GUESS Network Vis

Horizontal Time Graphs

Sci Maps

Börner, Katy, Huang, Weixia (Bonnie), Linnemeier, Micah, Duhon, Russell Jackson, Phillips, Patrick, Ma, Ni li Z A l G H i & P i M k (2009) R N k R d A l i dNianli, Zoss, Angela, Guo, Hanning & Price, Mark. (2009). Rete-Netzwerk-Red: Analyzing and Visualizing Scholarly Networks Using the Scholarly Database and the Network Workbench Tool. Proceedings of ISSI 2009: 12th International Conference on Scientometrics and Informetrics, Rio de Janeiro, Brazil, July 14-17 . Vol. 2, pp. 619-630.

Page 4: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Sci2 Tool

Geo Maps

Circular Hierarchy

Sci2 Tool: Supported Data Formats

Personal Bibliographies Bibtex (.bib)

Network Formats NWB (.nwb)

Endnote Export Format (.enw)

Data Providers Web of Science by Thomson Scientific/Reuters (.isi) Scopus by Elsevier ( scopus)

Pajek (.net) GraphML (.xml or

.graphml) XGMML (.xml)

Scopus by Elsevier (.scopus) Google Scholar (access via Publish or Perish save as CSV, Bibtex,

EndNote) Awards Search by National Science Foundation (.nsf)

Burst Analysis Format Burst (.burst)

O h FScholarly Database (all text files are saved as .csv) Medline publications by National Library of Medicine NIH funding awards by the National Institutes of Health

(NIH) NSF f di d b h N i l S i F d i (NSF)

Other Formats CSV (.csv) Edgelist (.edge) Pajek (.mat) T ML ( l) NSF funding awards by the National Science Foundation (NSF)

U.S. patents by the United States Patent and Trademark Office (USPTO)

Medline papers – NIH Funding

TreeML (.xml)

8

Page 5: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Sci2 Tool: Algorithms See https://nwb.slis.indiana.edu/communityp y

PreprocessingExtract Top N% RecordsExtract Top N RecordsNormalize Text

ModelingRandom GraphWatts-Strogatz Small World

Weighted & UndirectedClustering CoefficientNearest Neighbor DegreeStrength vs Degree

Slice Table by Line---------------------------------------------Extract Top NodesExtract Nodes Above or Below ValueDelete Isolates

Barabási-Albert Scale-FreeTARL

AnalysisNetwork Analysis Toolkit (NAT)

g gDegree & StrengthAverage Weight vs End-point DegreeStrength DistributionWeight DistributionRandomize Weights

---------------------------------------------Extract top EdgesExtract Edges Above or Below ValueRemove Self LoopsTrim by DegreeMST Pathfinder Network Scaling

Unweighted & UndirectedNode DegreeDegree Distribution---------------------------------------------K-Nearest Neighbor (Java)Watts Strogatz Clustering Coefficient

---------------------------------------------Blondel Community Detection---------------------------------------------HITS

Unweighted & DirectedNode IndegreeMST-Pathfinder Network Scaling

Fast Pathfinder Network Scaling---------------------------------------------Snowball Sampling (in nodes)Node SamplingEdge Sampling

Watts-Strogatz Clustering CoefficientWatts Strogatz Clustering Coefficient over K---------------------------------------------DiameterAverage Shortest PathShortest Path Distribution

Node IndegreeNode OutdegreeIndegree DistributionOutdegree Distribution---------------------------------------------K-Nearest Neighborg p g

---------------------------------------------SymmetrizeDichotomizeMultipartite Joining---------------------------------------------G d

Node Betweenness Centrality---------------------------------------------Weak Component ClusteringGlobal Connected Components---------------------------------------------

C

Single Node in-Out Degree Correlations---------------------------------------------Dyad ReciprocityArc ReciprocityAdjacency Transitivity

9

Geocoder---------------------------------------------Extract ZIP Code

Extract K-CoreAnnotate K-Coreness---------------------------------------------HITS

---------------------------------------------Weak Component ClusteringStrong Component Clustering---------------------------------------------

Sci2 Tool: Algorithms cont.See https://nwb.slis.indiana.edu/communityp y

--------------------------------

Extract K-Core

Annotate K Coreness

VisualizationGnuPlotGUESS

ScientometricsRemove ISI Duplicate RecordsRemove Rows with Multitudinous FieldsAnnotate K-Coreness

--------------------------------

HITS

PageRank

Weighted & Directed

Image Viewer---------------------------------------------Radial Tree/Graph (prefuse alpha)Radial Tree/Graph with Annotation

(prefuse beta)T Vi ( f b )

Detect Duplicate NodesUpdate Network by Merging Nodes---------------------------------------------Extract Directed NetworkExtract Paper Citation NetworkE A h P N kHITS

Weighted PageRank

Textual

Burst Detection

Tree View (prefuse beta)Tree Map (prefuse beta)Force Directed with Annotation

(prefuse beta)Fruchterman-Reingold with Annotation

(prefuse beta)

Extract Author Paper Network---------------------------------------------Extract Co-Occurrence NetworkExtract Word Co-Occurrence NetworkExtract Co-Author NetworkExtract Reference Co-OccurrenceBurst Detection (p )

---------------------------------------------DrL (VxOrd)Specified (prefuse beta)---------------------------------------------Horizontal Line GraphCi l Hi h

Extract Reference Co Occurrence (Bibliographic Coupling) Network

---------------------------------------------Extract Document Co-Citation Network

Circular HierarchyGeo Map (Circle Annotation Style)Geo Map (Colored-Region Annotation Style)

10

Page 6: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

NWB=Sci2 Tool: Output Formats

NWB tool can be used for data conversion. Supported output formats comprise:

CSV (.csv) ( )

NWB (.nwb)

Pajek (.net)

Pajek ( mat) Pajek (.mat)

GraphML (.xml or .graphml)

XGMML (.xml)

GUESS

Supports export of images into pp p g

common image file formats.

Horizontal Bar Graphs

11

Horizontal Bar Graphs

saves out raster and ps files.

Computational Scientometrics:

Studying Science by Scientific Meansy g y

Börner, Katy, Chen, Chaomei, and Boyack, Kevin. (2003). Visualizing Knowledge Domains. In Blaise Cronin (Ed ) ARIST Medford NJ: Information Today(Ed.), ARIST, Medford, NJ: Information Today, Inc./American Society for Information Science and Technology, Volume 37, Chapter 5, pp. 179-255. http://ivl.slis.indiana.edu/km/pub/2003-borner-arist.pdf

Shiffrin, Richard M. and Börner, Katy (Eds.) (2004). Mapping Knowledge Domains. Proceedings of the National Academy of Sciences of the United States of America, 101(Suppl_1). http://wwwpnas org/content/vol101/suppl 1/http://www.pnas.org/content/vol101/suppl_1/

Börner, Katy, Sanyal, Soma and Vespignani, Alessandro (2007). Network Science. In Blaise Cronin (Ed.), ARIST, Information Today, Inc./American Society for d y, / yInformation Science and Technology, Medford, NJ, Volume 41, Chapter 12, pp. 537-607.

http://ivl.slis.indiana.edu/km/pub/2007-borner-arist.pdf

Börner, Katy (2010) Atlas of Science. MIT Press.http://scimaps.org/atlas

12

Page 7: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Type of Analysis vs. Scale of Level of Analysis

Micro/Individual(1 100 d )

Meso/Local(101 10 000 d )

Macro/Global(10 000 d )(1-100 records) (101–10,000 records) (10,000 < records)

Statistical Analysis/Profiling

Individual person and their expertise profiles

Larger labs, centers, universities, research domains, or states

All of NSF, all of USA, all of science.

,

Temporal Analysis (When)

Funding portfolio of one individual

Mapping topic bursts in 20-years of PNAS

113 Years of physics Research

Geospatial Analysis (Where)

Career trajectory of one individual

Mapping a states intellectual landscape

PNAS publications

Topical Analysis Base knowledge from Knowledge flows in VxOrd/Topic maps ofTopical Analysis (What)

Base knowledge from which one grant draws.

Knowledge flows in Chemistry research

VxOrd/Topic maps of NIH funding

Network Analysis (With Whom?)

NSF Co-PI network of one individual

Co-author network NSF’s core competency (With Whom?) one individual

13

Type of Analysis vs. Scale of Level of Analysis

Micro/Individual(1 100 d )

Meso/Local(101 10 000 d )

Macro/Global(10 000 d )(1-100 records) (101–10,000 records) (10,000 < records)

Statistical Analysis/Profiling

Individual person and their expertise profiles

Larger labs, centers, universities, research domains, or states

All of NSF, all of USA, all of science.

,

Temporal Analysis (When)

Funding portfolio of one individual

Mapping topic bursts in 20-years of PNAS

113 Years of Physics Research

Geospatial Analysis (Where)

Career trajectory of one individual

Mapping a states intellectual landscape

PNAS publciations

Topical Analysis Base knowledge from Knowledge flows in VxOrd/Topic maps ofTopical Analysis (What)

Base knowledge from which one grant draws.

Knowledge flows in Chemistry research

VxOrd/Topic maps of NIH funding

Network Analysis (With Whom?)

NSF Co-PI network of one individual

Co-author network NIH’s core competency (With Whom?) one individual

14

Page 8: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Mapping the Evolution of Co-Authorship Networks Ke, Visvanath & Börner, (2004) Won 1st price at the IEEE InfoVis Contest.

15

Data: Available as mdb from http://iv.slis.indiana.edu/ref/iv04contestp // / /

Algorithms/Tools:Complete workflow with pointers to code are at http://iv.slis.indiana.edu/ref/iv04contest

16

Page 9: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Studying the Emerging Global Brain: Analyzing and Visualizing the Impact of Co-Authorship Teams Börner Dall’Asta Ke & Vespignani (2005) Complexity 10(4):58 67

Research question:

• Is science driven by prolific single experts

Börner, Dall Asta, Ke & Vespignani (2005) Complexity, 10(4):58-67.

s sc e ce d ve by p o c s g e e pe tsor by high-impact co-authorship teams?

Contributions:

• New approach to allocate citational credit.

• Novel weighted graph representation.

Data: Available as mdb from http://iv.slis.indiana.edu/ref/iv04contest

• Visualization of the growth of weighted co-author network.

• Centrality measures to identify author i

p // / /

Algorithms/Tools:Custom DB queries and code, not available.

impact.

• Global statistical analysis of paper production and citations in correlation with co-authorship team size over timewith co authorship team size over time.

• Local, author-centered entropy measure.

17

18

http://sci.slis.indiana.edu/sts

Page 10: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

(1) Science/Economy/STEM is Global( ) y

Illuminated Diagram Display

W. Bradford Paley, Kevin W. Boyack, Richard Kalvans, and Katy Börner (2007)Katy Börner (2007) Mapping, Illuminating, and Interacting with Science. SIGGRAPH 2007, San Diego, CA.

Page 11: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information
Page 12: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information
Page 13: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information
Page 14: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information
Page 15: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information
Page 16: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Science Puzzle Map for Kids by Fileve Palmer, Julie Smith, Elisha Hardy and Katy Börner, Indiana University, 2006. (Base map taken from Illuminated Diagram display by Kevin Boyack, Richard Klavans, and W. Bradford Paley.)

Page 17: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information
Page 18: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Mapping Science Exhibit – 10 Iterations in 10 yearshttp://scimaps.org

The Power of Maps (2005) Science Maps for Economic Decision Makers (2008)

The Power of Reference Systems (2006) Science Maps for Science Policy Makers (2009)

S i M f S h l (2010)Science Maps for Scholars (2010)Science Maps as Visual Interfaces to Digital Libraries (2011)Science Maps for Kids (2012)Science Forecasts (2013)

The Power of Forecasts (2007) Telling Lies With Science Maps (2014)

Exhibit has been shown in 72 venues on four continents. Currently at- NSF, 10th Floor, 4201 Wilson Boulevard, Arlington, VA- Wallenberg Hall, Stanford University, CA- Center of Advanced European Studies and Research, Bonn, Germany- Science Train, Germany.

36

Page 19: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

D b f 5th I i f M i S i E hibi MEDIA X M 18 2009 W ll b H llDebut of 5th Iteration of Mapping Science Exhibit at MEDIA X was on May 18, 2009 at Wallenberg Hall, Stanford University, http://mediax.stanford.edu, http://scaleindependentthought.typepad.com/photos/scimaps

37

(2) STEM is Evolving Dynamically

Self amplifying downward spiral | ‘systemic’ meltdown with intertwined breakdowns | ‘war room’ analyses | market wind tunnel |power market test bed | Regulators feel duty–bound to adhere to generally accepted and well-vetted techniques

“… while any new technical device or medical drug has extensive testing for efficiency, reliability and safety before it ever hits the market, we still implement new economic measures without any prior testing.” Dirk Helbing

Page 20: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Monitor and Analyze/Visualize STEM in Real Time

Design a ‘STEM Wind Tunnel’ or ‘STEM Knowledge Collider’ g g

That empowers anybody to see what new Research results Policy decisions Teaching material Jobs exist

Together with Bursts of activity Evolving communities of research/practice Evolving communities of research/practice Positive/negative feedback cycles

Ideally,y, what-if scenarios could be modeled.

Interactive Maps of Science – NIH FundingGoogle maps with charts and tables

http://scimaps.org/maps/nih/2007

Page 21: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

A Clickstream Map of Science – Bollen, Johan, Herbert Van de Sompel, Aric Hagberg, Luis M.A. Bettencourt, Ryan Chute, Marko A. Rodriquez, Lyudmila Balakireva - 2008

41

Interactive Maps of Science – Philanthropy

http://www.philanthropyinsight.org

Page 22: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Council for Chemical Research - Chemical R&D Powers the U.S. Innovation Engine. Washington, DC. Courtesy of the Council for Chemical Research - 2009

43

Mapping S&T Job Market Data in Real Time – GeoMapAngela Zoss, Michael Conover

DataThousands of full-text, location-specific, time stamped jobstamped job postings from Nature Jobs and Science CareersRSS feeds. The posts have been parsed and stored in a relationalin a relational MySQL database.

Jobs have beenJobs have been geolocated on a Google map.

Page 23: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Th UCSD MThe UCSD Map of Science used here is the product of a large study by

h hresearchers at the University of California - San Diego using 7.2

illi dmillion papers and over 16,000 separate journals, proceedings, and

i fseries from Thomson Scientific and Scopus over the five year period f 2001 2005from 2001 to 2005.

Mapping S&T Job Market Data in Real Time – SciMapAngela Zoss, Michael Conover

The UCSD Map of Science used here isScience used here is the product of a large study by researchers at the University of California San DiegoCalifornia - San Diego using 7.2 million papers and over 16,000 separate journals proceedingsjournals, proceedings, and series from Thomson Scientific and Scopus over the five year period fromfive year period from 2001 to 2005. Jobs were associated with nodes in the Map

f S i b fof Science by way of keyword extraction.

Page 24: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

(3) Open Data and Open CodeStudying Individual Local and Global STEM Flows and Activity Patterns

Design comprehensive databases that capture relevant data and

Studying Individual, Local, and Global STEM Flows and Activity Patterns

cyberinfrastructures that can be used to make sense of this data(stream).

STEM studies can be conducted at different levels:

local (individual),

meso (local, e.g., one institute, one funding agency), or

global level (all of science or world wide). g ( )

Using Statistical Analysis/Profilingy / g Temporal Analysis (When) Geospatial Analysis (Where) Topical Analysis (What) Topical Analysis (What) Network Analysis (With Whom?)

Page 25: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Sample Study – NSF Funding of STEM

Using NSF A ards SearchUsing NSF Awards Search:http://www.nsf.gov/awardsearchdownload relevant NSF awards that have “stem” and “education”in title, abstract, and awards.Active awards only.

Number of awards: 1,340Total awarded amount to date: $1,347,802,833

Retrieved on Oct 18, 2009

Federal K-12 STEM Education Program Funding in 2006

SOURCE: Department of Education, Report of the Academic Competitiveness Council, 2007

Search for awards that have “stem” and “education”Search for awards that have stem and educationin title, abstract, and awards.Active awards only. Query run on 10/18/2009.

Page 26: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Top-10 Projects with highest Award Amount to Date

1340 Funded Projects Horizontal Bar Graphs

Area size equals numerical value, e.g., award amount.

Horizontal Line Graph was selected.Input Parameters:Start Date: Start Date

Start date End date

Text, e.g., title

Size By: Awarded Amount to DateLabel: TitleEnd Date: Expiration Date

Page 27: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

1,340 Funded Projects Geographic Maps

Geocoder was selected.Input Parameters:Place Name Column: Organization StatePlace Type: STATEPlace Type: STATE..........Geo Map (Circle Annotation Style) was selected.Input Parameters:Longitude: LongitudeSize Circles By: Awarded Amount to DateColor Circle Exteriors By: Awarded Amount to DateColor Circle Interiors By: None (no inner color)Exterior Color Scaling: LinearExterior Color Range: Green to RedExterior Color Range: Green to RedInterior Color Range: Green to RedSize Scaling: LinearProjection: Albers Equal-Area ConicMap: US StatesA th N K BAuthor Name: K. BornerInterior Color Scaling: LinearLatitude: Latitude

What Co-PI Networks Exist?

Extract Directed Network was selected.Input Parameters:Source Column: Principal InvestigatorText Delimiter: |T t C l C PI N ( )Target Column: Co-PI Name(s)..........Network Analysis Toolkit (NAT) was selected.Nodes: 3225Isolated nodes: 276Edges: 2265Average total degree: 1.4047Average in degree: 0.7023Average out degree: 0.7023..........Delete Isolates was selected...........Node Degree was selected...........Weak Component Clustering was selected.Number of top clusters: 10722 clusters found, generating graphs for the top 10 clusters.

Giant component has 39 nodesNext largest networks have 35, 17, 16 nodes

Page 28: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

Co-PI Networks – Giant Component

Nodes = investigatorsSize and color coded by number of collaborators (degree)

Directed edges from PI to Co-PI

What Projects Fund Which PIs?

Extract Directed Network was selected.Input Parameters:Source Column: TitleText Delimiter: |Target Column: Principal Investigator..........Network Analysis Toolkit (NAT) was selected.Nodes: 2478Isolated nodes: 0Edges: 1337Average total degree: 1 0791Average total degree: 1.0791Average in degree: 0.5395Average out degree: 0.5395This graph is not weakly connected.There are 1144 weakly connected components. (0 isolates)The largest connected component consists of 14 nodes.The largest connected component consists of 14 nodes.Density (disregarding weights): 0.0002..........Node Indegree was selected...........Node Outdegree was selected...........GUESS

Page 29: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

What Projects Fund Which PIs - Zoom

What Programs at NSF are Co-Funding STEM?

Extract Co-Occurrence Network was selected.Input Parameters:Text Delimiter: |Column Name: Program(s)..........Node Degree was selected...........Network Analysis Toolkit (NAT) was selected.Nodes: 226Isolated nodes: 71Edges: 483Edges: 483No self loops were discovered.Average degree: 4.2743Density (disregarding weights): 0.019..........GUESSGUESS..........Weak Component Clustering was selected.79 clusters found ..........Network Analysis Toolkit (NAT) was selected.Nodes: 135Isolated nodes: 0Edges: 467No self loops were discovered.Average degree: 6.9185Density (disregarding weights): 0 0516Density (disregarding weights): 0.0516..........GUESS

Page 30: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

What Programs at NSF are Co-Funding STEM – Giant Component

What Organizations are funded by what NSF Programs?

Extract Directed Network was selected.Input Parameters:Source Column: OrganizationText Delimiter: |Target Column: Program(s)..........Network Analysis Toolkit (NAT) was selected.Nodes: 794Isolated nodes: 1Edges: 1592Average total degree: 4 0101Average total degree: 4.0101Average in degree: 2.005Average out degree: 2.005The largest connected component consists of 777 nodes.Density (disregarding weights): 0.0025....................Node Indegree was selected...........Node Outdegree was selected...........GUESS

OrganizationNSF Program

Page 31: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

What Organizations are funded by what NSF Programs?

Color and size coding by number of NSF programs that fund these organizationsfund these organizations.Institutions which are funded by 10 or more programs are labeled.

OrganizationNSF Program

What NSF Programs fund how many Organizations?

Color and size coding by number of organizations that are funded by these programsare funded by these programs.NSF programs which fund 10 or more organizations are labeled.

OrganizationNSF Program

Page 32: S701 Guest LectureS701 Guest Lecture - Indiana University · S701 Guest LectureS701 Guest Lecture Dr. Katy Börner Cyy,berinfrastructure for Network Science Center, Director Information

http://sci.slis.indiana.edu

All papers, maps, cyberinfrastructures, talks, press are linked from http://cns.slis.indiana.edu