TCP/IP Don McGregor Research Associate MOVES Institute [email protected].

download TCP/IP Don McGregor Research Associate MOVES Institute mcgredo@nps.edu.

If you can't read please download the document

Transcript of TCP/IP Don McGregor Research Associate MOVES Institute [email protected].

  • Slide 1
  • TCP/IP Don McGregor Research Associate MOVES Institute [email protected]
  • Slide 2
  • Background Youve got some computers, each running a simulation. How do you get them to talk to each other? Network
  • Slide 3
  • Network Communications Almost everything these days is done over TCP/IP, a framework for communications between devices Transmission Control Protocol/Internet Protocol Note that TCP/IP is a software concept, independent of hardware TCP/IP runs on all sorts of media Ethernet wire (10, 100, 1 GB, 10GB) Wireless Fiber Optic It can do this because it is layered--the higher levels hide the type of wire used
  • Slide 4
  • TCP/IP Protocol Stack Link Internet Transport: UDPTransport: TCP Application Hardware: ethernet, radio, fiber optic Internet Routing Sockets DoD M&S Network Protocols & lots more Protocol
  • Slide 5
  • Layering TCP/IP Sockets Ad-Hoc Protocols DISHLATENA Application/Simulation Application layer
  • Slide 6
  • Layering Notice that the hardware (link layer) is isolated from the application at the top. This means you can swap out the link layer every few years as faster media becomes available, and not affect your application at the top This is a Big Deal. The vast majority of money and programming time are tied up in the application layer, and you cant throw that away every few years Things like web browsers or games are written in such a way that they are completely isolated from hardware transport This allows us to created standards at the application layer that are stable over a period of decades
  • Slide 7
  • TCP/IP TCP/IP gives you the ability to send some bytes from one machine to another. As far as TCP/IP is concerned, its all just bytes. The application layer makes sense of those bytes by interpreting them. The application layer is where we standardize M&S network protocols. Note that the application layer includes both the DoD protocol (DIS, HLA, etc) along with what we think of as the application (OneSAF, etc)
  • Slide 8
  • Application Layer This is where DoD M&S network standards live. There have been several over the years: Ad hoc: Created as needed for proprietary systems; elder days to present Distributed Interactive Simulation (DIS): 1993-> present High Level Architecture (HLA): circa 1998-present TENA: Circa 2000-present Simulation protocols are usually embedded in applications like OneSAF, Janus, etc. Application layer in network-speak usually refers to the protocols, not the whole application, but this isnt hard-and-fast
  • Slide 9
  • Ad Hoc In the early years people just went out and did stuff, because it was all new This meant that the Boeing simulator didnt talk to the LockMart simulator, and furthermore the LockMart aircraft simulator might not talk to the other LockMart aircraft simulator The protocols were typically built on top of sockets (UDP and TCP) But it worked, sort of This is still done in the commercial gaming industry (Why? What relevance does this question have to DoD?)
  • Slide 10
  • Distributed Interactive Simulation The lack of standards led to DIS. Everyone in procurement realized that lack of interoperability was bad, so they got together and agreed upon a standard for M&S Agreed upon by the Simulation Interoperability Standards Organization (SISO) ratified as an IEEE standard This means anyone can read the standard, implement it, and have their simulator be interoperable with another DIS simulator
  • Slide 11
  • High Level Architecture HLA is a follow-on to DIS, intended to address a wider range of simulations and abstract away some of the network details. DIS was first person shooter oriented, while HLA can be used in a wider range of simulations, such as timestep HLA tries to hide the socket layer. Much more on this later
  • Slide 12
  • TENA Test & Training Enabling Architecture-- widely used in DoD ranges Very similar to CORBA (a distributed object technology, widely standardized since the 90s) There is considerable overlap between DIS, HLA, and TENA
  • Slide 13
  • Emerging Web The field is just starting deploy new web- based approaches. Typically this includes web sockets, webRTC, and mobile technologies A lot of promise, compelling economics Deploy on cloud, use on mobile devices Central point for upgrades Vastly simplified configuration management Infinite CPU on cloud side
  • Slide 14
  • Communications TCP/IP is the basic framework for communicating between devices Devices is a broad term. Can include desktops, cell phones, toasters, coke machines, etc.
  • Slide 15
  • TCP/IP Link Internet Transport: UDPTransport: TCP Application Hardware: ethernet, radio, fiber optic Internet Routing Sockets DoD M&S Network Protocols & lots more
  • Slide 16
  • Layers Link layer: this is the hardware layer (eg, ethernet, 802.11b) The switch you get at Best Buy is an example of a link layer device, Cat5 ethernet cable, fiber optic cable, wireless, etc
  • Slide 17
  • Link Layer You can easily spend an entire semester studying only the link layer We will assume it magically works
  • Slide 18
  • IP Layer The next layer up is responsible for routing packets to a destination When you send War & Peace, TCP/IP breaks up the text into packets, routes the packets to the destination, and then reassembles them back to the original text The IP layer is responsible for getting the individual packets to their destination process Routers handle IP. Examples include Cisco, Foundry, Vyatta. These are (often) expensive and require major geek support to run IP is mostly opaque to application programmers We will assume it magically works
  • Slide 19
  • Transport The sockets layer. This is where the developers of application layers mostly live Sockets are a way to send bytes from one device to another (or from the same device to itself). Sockets dont know anything about the content of the messages being sent--to them its all just a bunch of bytes. Its a way to get N bytes of data from one host to another
  • Slide 20
  • Sockets Notice that at the transport layer there are two types of sockets: TCP and UDP. These are intended to handle two different application domains You can use either or both in a single application Ie, OneSAF can use both a network protocol based on TCP and one based on UDP at the same time
  • Slide 21
  • TCP Sockets TCP sockets have some important properties: Reliable delivery of data In-order delivery of data No duplicates of data Built-in rate control What it attempts to replicate is reading and writing to a file
  • Slide 22
  • TCP Sockets Reliable: if you send data, there wont be gaps in the data sent, and messages are guaranteed to arrive, or youll get an error Recall that TCP/IP breaks up big chunks of data into many packets to send across the network. reliability means that if the network somehow drops one of those packets, it will be resent
  • Slide 23
  • TCP Sockets In order: When TCP/IP breaks up all the packets for sending, it will ensure that the packets are re-assembled in the same order they were sent on the receiving side No duplicates: in some obscure situations, the underlying network may duplicate packets. TCP ensures that the duplicate packets are discarded
  • Slide 24
  • TCP Sockets Rate limiting: What if you have a really fast server computer sending to an iPhone? The server has a really fast CPU and is hooked up to a fast network; it doesnt necessarily know it is sending to a slow CPU across a slow network Without this feature you can easily overwhelm the receiving machine and network--its a sort of denial of service attack TCP automatically throttles back the sending rate if too many packets are being dropped
  • Slide 25
  • TCP TCP Socket Data TCP sockets replicate writing to a file; data is sent (and received) across a full-duplex connection Writing to a file: open the file, write data, close the file. The Data appears in the file: reliable, in order, no dupes, rate limited Same thing with TCP sockets Host A Host B
  • Slide 26
  • UDP Sockets UDP is an alternative to TCP sockets that eliminate some of the features of TCP Unreliable data delivery--there is no guarantee that the receiving application will get everything you send Data may arrive out of order Duplicate data may arrive There is no built-in rate limiting Packet-oriented rather than stream-oriented
  • Slide 27
  • UDP Sockets Some of these features sound counter- intuitive. Why on earth would you use an API that may throw away data? The issue is that TCP introduces some overhead in latency and to a lesser extent bandwidth Sometimes we have applications that are fine if most of the data is received Example: position updates in a game
  • Slide 28
  • UDP Sockets The player on my computer is controlling a tank, and I will send out updates of its position every 1/30th of a second What happens if one out of a hundred updates is dropped?
  • Slide 29
  • UDP Sockets UDP is packet-oriented rather than stream- oriented TCP is handled much like reading and writing from a file, which is just a long stream of bytes In UDP you create discrete messages and send them
  • Slide 30
  • Sockets Note that both TCP and UDP are responsible only for sending data. They do not attempt to make any sense of the data itself--that is the responsibility of the next layer up Files are not responsible for the format or meaning of the data written to them--thats the responsibility of the applications that read or write the file The sockets API only gets data to the destination; once there the application/protocol has to make sense of the data
  • Slide 31
  • IP Numbers Every host (computer) on a network is assigned a unique IP number, usually written like this: 172.20.80.42 This is called the dotted decimal format. In reality the IP is 4 bytes long, and each number can (sort of) be in the range of 0-255. This uniquely identifies the computer on the network; you cant talk to something directly unless you have a name to distinguish it, and the IP is the real name of the computer in TCP/IP I want to connect to the host 131.120.7.15 will connect you to a particular host on the internet
  • Slide 32
  • IP Numbers You can see what IP your computer has by going to Control Panel->Network->TCP/IP->Properties, or ipconfig /all, or on OS X Control Panel->Network, or on Linux /sbin/ifconfig How are IPs assigned? If we have unique numbers for hosts we have to have some way to assure that each host is configured with a unique IP Two basic ways: Manually Dynamic Host Configuration Protocol (DHCP)
  • Slide 33
  • IP Numbers Manually: go to each machine, type in the IP number Whats wrong with this? DHCP: when the host boots, it contacts a server and asks for an IP. The server assigns an IP to the host from a floating pool of IPs The host has a lease on the IP for a limited time. After the time is expired, the server takes back the IP unless it has been renewed Why this approach? Why not have the computer resign the IP when done?
  • Slide 34
  • IPs DHCP has a weakness: a host may be assigned a different number the next time it boots This is perfectly OK for desktop clients, but not for servers. Typically people want to contact servers, and if the IP is constantly changing, they dont know how to address it. Clients, on the other hand, spend their time contacting servers, not being contacted Manually assign IPs to servers Have laptops & desktop clients use DHCP
  • Slide 35
  • Domain Name Service Suppose you want to connect to a web server. It would be bad to force users to memorize 72.21.210.11 rather than amazon.com DNS maps a name to an IP number. This is done by a server sitting on the network. Your host contacts the DNS server and asks what is the IP for www.nps.edu? The DNS server responds with the IPwww.nps.edu The DNS server is set on your host by DHCP (usually).
  • Slide 36
  • DNS DNS Server give me the IP for www.nps.edu the IP is 205.155.4.12 NameIP www.nps.edu205.155.4.12 Beatnik.ern.nps.edu 172.20.18.4 Mail.nps.edu 205.155.4.2 The DNS server maintains a table matching names to IPs
  • Slide 37
  • DNS The campus admins can enter the IPs for all the server hosts on campus. But what if we want to use a name to refer to a server off campus, like amazon.com? The campus admins have no idea what assignments amazon is making, and whats more the DNS server cant realistically have a database with every host name on the internet To refer to a host name off campus, the local DNS server simply asks amazons DNS server to resolve the name on our behalf
  • Slide 38
  • Off-Site DNS give me the IP for Amazon.com the IP is 72.12.18.4 Campus DNS Server Amazon.com DNS Server
  • Slide 39
  • DNS The campus DNS server contacts amazon on your behalf, gets the IP number, returns it to you, and caches the result for some period of time so later lookups are faster Potential problems? > nslookup www.apple.comwww.apple.com Non-authoritative answer: Name:apple.com Address: 17.149.160.49 The non-authoritative means the local DNS server got it from cache
  • Slide 40
  • DNS So how does the local DNS server know how to contact the Amazon DNS server? When a domain like nps.edu or amazon.com is registered, one of the required pieces of information is at least two DNS servers You can see the information with whois: http://www.networksolutions.com, whois link http://www.networksolutions.com
  • Slide 41
  • Whois Registrant: contact info Admin Contact: the suits Technical contact: the t-shirts DNS Servers: the names of the servers where the name-to-IP information is entered You can set up your own DNS servers or use someone elses as a paid or free service This is the standardized way for the info to get into the system
  • Slide 42
  • DNS The internet maintains 13 root DNS servers. These server Ips are hard-coded into a local DNS servers config file In reality it is a lot more than 13 servers; these are really clusters that are geographically distributed These root servers use the data entered during the domain registration process to direct a DNS server that has the information needed
  • Slide 43
  • Root DNS Root EduComOrg NetXXXUk AmazonApple
  • Slide 44
  • DNS When your local DNS server starts up cold, it has the root DNS servers. The first time it gets a request for amazon.com, it goes there, gets the.com DNS server, and then asks the.com server for amazons server All these answers are cached, so after the TLDs are retrieved it doesnt need to talk to the root servers
  • Slide 45
  • DNS The net result: you can refer to a host by name rather than IP, and as long as you have a functioning DNS server it will all work
  • Slide 46
  • DNS & Info Operations Obvious question: what happens if the DNS server (either locally or TLD or root) is compromised? Citizen in country under attack decides to bring up his local newspaper site; instead he is redirected to the attackers propaganda site This is used as part of the great firewall of China. PRC uses DNS servers to for example redirect facebook.com to alternative servers
  • Slide 47
  • Stupid DNS Tricks DNS can be used to aid scalability Round-robin DNS: The DNS server returns a list of IPs rather than a single IP, and the client just picks one. All the servers have identical content Now you have N servers to handle the load instead of just one
  • Slide 48
  • Stupid DNS Tricks Geographically based DNS: There are lists of IP numbers and their approximate location. Based on the IP of the client making the request you can return an IP thats close to the client In europe if you go to google.com youll get the language and country-specific site that corresponds to the IP of your computer
  • Slide 49
  • Content Distribution Network The idea of a CDN is to put the content close to the client. This is important for broken things like NMCI This is done by playing DNS tricksas with geographic DNS, the request is sent to a server close to the client or better connected with a cached copy of the site content. If the content is not present at the CDN edge server, its pulled down from the authoritative server Vendors: Akamai, Google, many others
  • Slide 50
  • Port Numbers Suppose you have a server named nps.edu that runs mail and a web server. You want to contact it, so you refer to it by IP (perhaps after a DNS lookup) But which program on the host do you talk to? Its running both mail and web. We need some further way to specify which service on the host we want to talk to: mail, web, DNS, OneSAF, etc
  • Slide 51
  • Port Numbers Each host (IP number) has ports that range in number from 0-64K. There is a separate port range for TCP and UDP, so UDP port 25 is not the same as TCP port 25 Port is a software concept, not a physical connection By convention, certain programs listen on certain ports. For example, mail servers traditionally listen on TCP port 25, and web servers on TCP port 80 So: to contact a web server at www.nps.edu, you should refer to IP 205.155.7.12 and port 80. If a mail server is also running, you can contact it at 205.155.7.12 and port 25www.nps.edu
  • Slide 52
  • Port Numbers How do you decide what port number goes with what service? You need a priori knowledge of this The IANA/ICANN maintains databases of various arbitrary number assignments for the internet http://www.iana.org/assignments/port- numbers
  • Slide 53
  • Firewalls What if you dont want someone connecting to your laptop? Firewalls prevent an outsider from establishing a connection to your host except on ports that you specify Firewalls can run on a host, or at a network border Common cause of failure when doing development or classwork 53
  • Slide 54
  • Firewalls Typically they are configured to prevent all but explicitly approved port connections 54
  • Slide 55
  • Typical Site Configuration 55
  • Slide 56
  • Summary TCP/IP layering UDP and TCP sockets IP numbers DNS Ports & Firewalls