Tutorial session 2 Network annotation

25
Tutorial session 2 Network annotation Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar

description

Tutorial session 2 Network annotation. Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar. Overview. Focus: Network annotation and visualization Loading and manipulating attributes Identifier mapping Mapping data onto the network - PowerPoint PPT Presentation

Transcript of Tutorial session 2 Network annotation

Page 1: Tutorial session 2 Network annotation

Tutorial session 2Network annotation

Exploring PPI networks using Cytoscape

EMBO Practical Course Session 8

Nadezhda Doncheva and Piet Molenaar

Page 2: Tutorial session 2 Network annotation

Overview

Focus: Network annotation and visualization Loading and manipulating attributes

Identifier mapping

Mapping data onto the network

Use visuals to convey data

Concepts Vizmapper

Data Human Neuroblastoma mutated genes list

04/21/232

Page 3: Tutorial session 2 Network annotation

Attributes Nodes and edges can

have attributes associated with them Gene expression data Mass spectrometry data Protein structure

information Gene Ontology terms,

etc.

Cytoscape supports multiple data types: Numbers, Text, Logical, Lists...

04/21/233

Page 4: Tutorial session 2 Network annotation

Loading attributes Use pre-formatted attribute files Import attribute from table

Excel file

Comma or tab delimited text

Import attribute from web services NCBI Entrez Gene

Ensembl Biomart

Use ‘import attribute or expression matrix’ Create attributes manually in the attribute

browser04/21/234

Page 5: Tutorial session 2 Network annotation

Loading attributes from table (Demo)

04/21/235

Page 6: Tutorial session 2 Network annotation

Use Case 2.1: Neuroblastoma Childhood neuro endocrine tumor Young children Variable clinical outcome

Low stages Good prognosis

Numeric changes of chromosomal copy numbers

High stages Poor prognosis

Structural chromosomal defects (LOH1p / 11q etc)

Few gene defects identified MYCN amplification (20%) ALK activation (7%) CCND1 / PHOX2B / NF1

Page 7: Tutorial session 2 Network annotation

Use Case 2.1: Neuroblastoma Poor prognosis Subgroup (~1/3) characterized by MYCN

amplification Rest unknown

Page 8: Tutorial session 2 Network annotation

Use case: Assignment 2.1 Whole genome sequence of 86 tumor vs blood 1043 genes with mutations

1.Load the list of genes (neuroblastoma_mutated_symbols.txt) as a network

2.Use the tab separated dataset (neuroblastoma_mutated_annotations.txt) to map additional information

1. Make sure the attributes have informative names

http://cytoscape.org/manual/Cytoscape2_8Manual.html#Import Attribute Table Files

04/21/238

Page 9: Tutorial session 2 Network annotation

Attribute management

04/21/239

Select attributes for display

Specific Attribute Tabs: for Nodes, Edges, and NetworkNode or Edge ID

Different type of attributes: Strings, Numbers, …

Page 10: Tutorial session 2 Network annotation

Tips & Tricks: Root Graph and sessions ”There is one graph to rule them all...” The networks in Cytoscape are all ”views” on

a single graph. Changing the attribute for a node in one

network will also change that attribute for a node with the same ID in all other loaded networks

There is no way to ”copy” a node and keep the same ID

Make a copy of the session

04/21/2310

Page 11: Tutorial session 2 Network annotation

Identifier mapping Identifiers (IDs) are ideally unique, stable name or

numbers But: too many IDs and different database records for

Gene, DNA, RNA, Protein The ID Mapping challenge:

Avoid errors by mapping IDs correctly Gene names are ambiguous Excel introduces errors Problems reaching 100 % coverage

Recommendations (for proteins and genes): Map everything to Entrez Gene IDs using a spreadsheet Manually curate missing mappings to achieve 100 %

coverage Be careful of Excel auto conversions

04/21/2311

Page 12: Tutorial session 2 Network annotation

Identifier mapping (Demo)

04/21/2312

Page 13: Tutorial session 2 Network annotation

Use case: Assignment 2.21. Use the Biomart plugin to map UniProt

identifiers on the genes

http://cytoscape.org/manual/Cytoscape2_8Manual.html#Node Name Mapping

04/21/2313

Page 14: Tutorial session 2 Network annotation

Data mapping

Mapping of data values associated with graph elements onto graph visuals

04/21/2314

Page 15: Tutorial session 2 Network annotation

Data mapping Visual attributes

Node fill color, border color, border width, size, shape, opacity, label

Edge type, color, width, ending type, ending size, ending color

Mapping types Passthrough (labels)

Continuous (numeric values)

Discrete (categories)

Visual style

04/21/2315

Page 16: Tutorial session 2 Network annotation

VizMapper

04/21/2316

Default Visual Style editor

List of Data attributes

List of Visual Styles

List of Visual attributes

Mapping definition

Page 17: Tutorial session 2 Network annotation

Data mapping (Demo)

04/21/2317

Page 18: Tutorial session 2 Network annotation

Tips & Tricks: Data mapping

Avoid cluttering your visualization with too much data

Map the data you are specifically interested in to call out meaningful differences

Mapping too much data to visual attributes may just confuse the viewer

Create multiple networks and map different values

04/21/2318

Page 19: Tutorial session 2 Network annotation

Use case: Assignment 2.31. Map the size of the nodes to the number of

occurrences

2. Map color to the tumor ids

1. Hint: use a rainbow pattern

http://cytoscape.org/manual/Cytoscape2_8Manual.html#Visual Styles

04/21/2319

Page 20: Tutorial session 2 Network annotation

Exploring expression data VistaClara plugin

Exploratory data analysis of multi-experiment microarray studies

A graphical and interactive alternative to the standard attribute browser

04/21/2320

Page 21: Tutorial session 2 Network annotation

VistaClara (Demo)

04/21/2321

Page 22: Tutorial session 2 Network annotation

Filtering & editing data Use filters

QuickFind nodes and edges

Index the network based on a node or edge attribute

Dynamic filtering for numerical attributes

Build complex filters using AND, OR, NOT relations

Define topological filters (considers properties of near-by nodes)

Create subnetworks

04/21/2322

Page 23: Tutorial session 2 Network annotation

Filtering & editing data (Demo)

04/21/2323

Page 24: Tutorial session 2 Network annotation

Use case: Assignment 2.41. What is the gene with most mutations?

2. Filter the network for genes with more than one mutation (why?) and save the new network.

3. Use the Bisogenet plugin...

1. ...to find interactions among these

2. ...to find interactions among these and their first neighbours (or explore different settings of Bisogenet according to your taste)

4. Store your session for later usehttp://cytoscape.org/manual/Cytoscape2_8Manual.html#Finding and Filtering

Nodes and Edgeshttp://bio.cigb.edu.cu/biso/BisoGenet_User_Manual.pdf

04/21/2324

Page 25: Tutorial session 2 Network annotation

To be continued…

Build, visualize and analyze your own network with Cytoscape Network generation Network annotation and visualization Network analysis

Identify active subnetworks

Analyze Gene Ontology enrichment

Perform topological analysis

Find network clusters

Find network motifs

04/21/2325