Google Scholar, OverLeaf, and Gephi
Transcript of Google Scholar, OverLeaf, and Gephi
Google Scholar, OverLeaf, and
Gephi
Prof. Ralucca Gera, Applied Mathematics Dept.Naval Postgraduate SchoolMonterey, [email protected]
Excellence Through Knowledge
Google Scholar (https://scholar.google.com/)•
Google Scholar’s goals
• It enables searches of scholarly literature (scholarly publications, abstracts, books…).
• Easy to use (same familiar search bar as Google, but results are limited to scholarly resources).
• Contains links to the article’s PDF, Postscript, HTML format.
• Easy to view key information about articles: “cited by”, “related articles” and so on.
• Easy to export citation to LaTeX that can be used for any research formatting style.
Historical background
–Google Scholar was released (in beta) in 2004–Not the first freely available citation database
(CiteSeer, Scirus, etc)–Not subject specific
Many use it as a viable alternative to traditional citation databases such as Web of Science, since:
– Easy of use (free, no account needed)– Extensive coverage of articles– Encodes wide range of metrics about
articles/authors– However metrics are not perfect (data base is not
complete, some duplications exist, algorithms 4
Features of Google Scholar
• Searches all scholarly literature from one convenient place
• Explores related works, citations, authors, and publications
• Locates the complete document through your library or on the web
• Keeps up with recent developments in any area of research
• Checks who's citing your publications, creates a public author profile (if you get an account)
5Source: http://scholar.google.com/scholar/about.html#about
How are documents ranked?
“Google Scholar aims to rank documents the way researchers do, • weighing the full text of each document, • where it was published, • who it was written by, • as well as how often and how recently it has
been cited in other scholarly literature.”
6Source: http://scholar.google.com/scholar/about.html#about
H-index of a scientist from wikipedia
• A scholar with an index of h has published hpapers each of which has been cited in other papers at least h times
• The h-index of a publication is the largest number h such that at least h articles in that publication were cited at least h times each.
7Source: https://en.wikipedia.org/wiki/H-index
For example, a publication with five articles cited by 17, 9, 6, 3, and 2 papers respectively, has the h-index of 3.
Source: http://scholar.google.com/intl/en-US/scholar/metrics.html#metrics
Google Scholar Available Metrics
• The h-core of a publication is the set of top cited h articles from the publication. – For example, the publication above has the h-core with three
articles, those cited by 17, 9, and 6 in the list 17, 9, 6, 3, and 2 with the h-index 3.
• The h-median of a publication is the median of the citation counts in its h-core. – For the example before, the h-median of the publication was 9
(recall the references are 17, 9, 6, 3, and 2 times, of which17, 9, 6 are in the k-core).
• The h-median is a measure of the distribution of citations to the articles in the h-core.
8Source: http://scholar.google.com/intl/en-US/scholar/metrics.html#metrics
Google Scholar Available Metrics
• Finally, the h5-index, h5-core, and h5-median of a publication are, respectively, the h-index, h-core, and h-median of the articles that were published in the last 5 complete calendar years.
9Source: http://scholar.google.com/intl/en-US/scholar/metrics.html#metrics
Try Google Scholar now!
Excellence Through Knowledge
ShareLaTeX
• Free: https://www.overleaf.com/• OverLeaf is a web-based real-time collaborative
(like GoogleDocs) LaTeX editor (no need to download LaTeX on your machine)
• Documentation: https://www.overleaf.com/learn• Templates:
https://www.overleaf.com/latex/templates• Use it:www.overleaf.com (use your NPS email).
11
Use your NPS email!
Synch it with the Dropbox and GitHub (the dropbox account must use the same email addressed use to create the overleaf account): 1. Click Account on the
upper right on the main site2. Account settings3. Log in4. Dropbox Integration
12
Gephi Overview(https://gephi.org/)
Resources for Gephi:
• Overview and explanations of Gephi• Gephi’s overview tutorial• An introductory video to create data for Gephi
and to use degree, closeness and betweeness (also posted on the website unde today’s lecture).
• Gephi’s overview of layouts• Basic navigation
Layouts
• Layout choices (install plugins for more):Force directed (repulsion) ones:– Force Atalas 2 (It is focused on being useful to explore and get
meaning for real data, and a good readability, slow)– Yifan Hu (similar to FA2, fast, good for large graphs)– Fruchterman-Reingold (The nodes are the mass particles and the
edges are springs between the particles. The algorithms try to minimize the energy of this physical system. It has become a standard but remains very slow.)
– OpenOrd layout (good for communities)
Not force directed: – Expansion– Geographic map with GeoLayout 15
SAVE
• Once you have a visualization that you like, save the network, so that the next time you open it looks the same
• Cannot use undo in Gephi• When you run an analysis, save the network
each time with a different name • When you open part of a network on a new tab
in Gephi, save that as well.
16Try Gephi Now!
Preview Tab
• Click Preview next to the Data Laboratory, youmight like that view of thenetwork better:
• If you export, then this is what you export:
17
Ranking nodes based on a statistic
• Once you ran a statistic (say degree) => size/color the nodes based on the ran statistic.
• Under on the topleft, choose Nodes andeither size or color
• Depending on the version you run, you will see:
18
Filtering nodes based on a statistic
• Find Filters on the top right, next to Statistics
• Under topology, you can findthe centralities
• Choose one, drag and drop it to the Queries
• Choose the bounds needed.
19
Export
• Take a screen shot using Snipping Tool, or• Export the visualized graph as SVG or PDF:
– Go to preview (fix if needed)– Resize for large networks– Click SVG
(SVG is vectorial graphics like PDF so they scale to different sizes nicely)
20
Other statistics
21
• Average path length: under the statics module, right
• Computes the average ofshortest paths between allpairs of nodes
• Result: