mir/ovs - ovs - Mike's Git repositories

mir/ovs

mirror of https://github.com/openvswitch/ovs synced 2025-08-28 04:47:49 +00:00

Author	SHA1	Message	Date
Ben Pfaff	c1c9c9c4b6	Implement QoS framework. ovs-vswitchd doesn't declare its QoS capabilities in the database yet, so the controller has to know what they are. We can add that later. The linux-htb QoS class has been tested to the extent that I can see that it sets up the queues I expect when I run "tc qdisc show" and "tc class show". I haven't tested that the effects on flows are what we expect them to be. I am sure that there will be problems in that area that we will have to fix.	2010-06-17 15:04:12 -07:00
Ben Pfaff	ff4ed3c9a1	netdev-linux: Create rtnetlink socket up front instead of on demand. This simplifies a bit of existing code since it is known that an rtnetlink socket will always be available. It will simplify additional code in upcoming commits.	2010-06-17 10:30:19 -07:00
Ben Pfaff	6912370445	netlink: Drop sock parameter from nl_msg_put_(ge)nlmsghdr(). These two functions use their "sock" parameter only to figure out the nlmsg_pid to put in the nlmsghdr. But that field can be filled in just as well right before sending the message. Since our functions for sending Netlink messages always modify the nlmsghdr anyhow (to fill in the length), there is little benefit to filling in the nlmsg_pid in advance. The cost, on the other hand, is having to pass another argument to functions that already have too many. So this commit removes the argument.	2010-06-17 10:30:18 -07:00
Jesse Gross	f4b6076aca	netdev-vport: Use vport set_stats instead of internal dev. In certain cases we require the ability to provide stats that are added to the values collected by the kernel (currently only used by bond fake devices). Internal devices previously implemented this directly but now that their stats are now handled by the vport layer the functionality has been moved there. This removes the userspace code to set the stats and replaces it with a mechanism to access the equivalent functionality in the vport layer.	2010-06-10 14:30:51 -07:00
Jesse Gross	7fbef77a30	netdev-linux: Add capability to get stats from vport layer. The vport layer has the ability to track stats using 64-bit counters, even if the kernel is only 32-bit. This first attempts to collect stats from these counters if they are available and otherwise falls back to the normal Linux interfaces.	2010-06-10 14:30:51 -07:00
Jesse Gross	61b999dd6f	netdev-linux: Give tap FD to first opener. Tap devices can have two FDs that allow transmit and receive from different perspectives. We previously would always share one of the FDs among all openers. However, this is confusing to some users (primarily the DHCP client) which expect tap devices to behave like any other device. Now we give the tap FD to the first opener, which knows that it has opened a tap device, and a normal system FD to everyone else for consistency.	2010-06-01 17:27:45 -07:00
Jesse Gross	92df599cb2	netdev-linux: Fix tap device stats. For tap and internal devices we swap the transmit and receive stats to appear consistent with other devices. However, the check whether to store the stats in a temporary location before the swap did not include tap devices, which lead to the use of uninitialized memory when the swap occured.	2010-06-01 17:27:45 -07:00
Jesse Gross	4d10512c91	netdev-linux: Quiet down ingress policing. If we attempt to remove ingress policing and receive "invalid argument" it means that policing isn't compiled into the kernel. If it isn't compiled in then accept that policing has been successfully removed.	2010-05-19 14:12:27 -07:00
Jesse Gross	2158888d8d	patch: Remove veth driver. Now that we have a new patch implementation, remove the veth driver and its userspace components. Then rename 'patchnew' to 'patch'. The new implementation is a drop-in replacement for the old one.	2010-05-18 12:57:25 -07:00
Ben Pfaff	6f42c8ea9a	netdev-linux: Optimize removing policing from an interface. It is very expensive to start a subprocess and, especially, to wait for it to complete. This replaces the most common subprocess operation in netdev_linux_set_policing() by a Netlink socket operation, which is much faster. Without this and the other netdev-linux commits, my 1000-interface test case runs in 1 min 48 s. With them, it runs in 25 seconds.	2010-05-05 14:00:50 -07:00
Ben Pfaff	80a86fbed4	netdev-linux: Cache policing values. Without this and the following netdev-linux commits, my 1000-interface test case runs in 1 min 48 s. With them, it runs in 25 seconds.	2010-05-05 14:00:50 -07:00
Ben Pfaff	8e46022197	netdev-linux: Factor out removing policing. This is duplicated code that the following commit will rewrite.	2010-05-05 14:00:50 -07:00
Ben Pfaff	a5af30fbaa	netdev-linux: Factor out obtaining an RTNL socket. Another function needs this same functionality in an upcoming commit, so factor this into a new function get_rtnl_sock().	2010-05-05 14:00:50 -07:00
Ben Pfaff	8722022c0c	Update fake bond devices' statistics with the sum of bond slaves' stats. Needed by XAPI to accurately report bond statistics. Ugh. Bug NIC-63.	2010-04-19 11:12:27 -07:00
Jesse Gross	6f643e4946	tunneling: Remove old GRE implementation. The new GRE implementation provides a complete drop in replacement for the old Linux based implementation. Therefore, remove the old implementation and rename "grenew" to "gre".	2010-04-19 09:11:58 -04:00
Jesse Gross	658797c83a	netdev-linux: Don't free a member of a struct. We allocate struct netdev_linux which contains struct netdev but free the netdev. In practice this makes no difference because the netdev is the first member of the struct but we should be correct anyways.	2010-04-19 09:11:57 -04:00
Jesse Gross	15b3596a41	netdev-linux: Check notifications are for netdev-linux device. When receiving a change notification from rtnetlink we checked whether a netdev of that name existed and if so tried to handle it. This also checks that the type of the device is one handled by netdev-linux.	2010-04-19 09:11:57 -04:00
Justin Pettit	8aed4223e0	netdev: Add support for "patch" type This commit introduces a new netdev type called "patch". A patch is a pair of interfaces, in which frames sent through one of the devices pop out of the other. This is useful for linking together datapaths. A patch's only argument on creation is "peer", which specifies the other side of the patch. A patch must be created in pairs, so a second netdev must be created with the "name" and "peer" values reversed. The current implementation is built using veth devices. Further, it's limited to the veth devices which support configuration through sysfs. This limits the ability to use a "patch" on 2.6.18 kernels using the veth device we include (read: flavors of XenServer 5.5). In the not too distant future, the implementation will be modified to use the new kernel port abstraction introduced by Jesse Gross's forthcoming GRE work. At that point, patch devices will work on any Linux platform supported by OVS.	2010-04-15 03:50:28 -07:00
Jesse Gross	468991ad6c	gre: Add support for path MTU discovery. This allows path MTU discovery to properly work when used with bridging. While there was previously support for PMTUD it used the kernel's IP stack. This works fine for routing but when bridging it is possible that a complete network is operating over the bridge that the kernel has no knowledge of and the ICMP fragmentation needed packets are lost. When a packet arrives that is above the MTU of the tunnel, an ICMP message is synthesized and send back on the device that the original packet came from. This does not rely on the kernel IP stack and is therefore independent of the routing table. Both IPv4 and IPv6 are supported, including over VLANs. Other types of packets that are over the MTU are encapsulated and the outer packets are fragmented. This entire functionality is a layer violation since bridging operates at layer 2 and fragmentation is a function of layer 3. For this reason it is possible to disable PMTUD, which will provide complete transparency but will cause the outer IP packets to be fragmented.	2010-03-05 16:32:05 -05:00
Jesse Gross	8ab4016b36	gre: Allow ToS on outer packet to be configured. When creating a GRE tunnel, it is now possible to either set the ToS of the outer packet to a fixed value or copy it from the inner packet.	2010-03-05 16:31:27 -05:00
Jesse Gross	a9a4b30c00	gre: Always set TTL on outer packet to 64. Currently the TTL is copied from the inner packet of the tunnel to the outer packet if the inner packet is IP. This is good if your GRE packets might make it into the input of your device but bad if you want to be fully transparent. This also resolves an inconsistency between tunnels set up using the ioctl and using Netlink. The ioctl version would force PMTUD on if a fixed TTL is set as a backup way to prevent loops but it never made it over to the newer Netlink code so obviously no one cares too much about it. This removes it to provide consistency and transparency. Basically, don't create loops and you will be happy.	2010-03-05 16:31:27 -05:00
Ben Pfaff	c69ee87c10	Merge "master" into "next". The main change here is the need to update all of the uses of UNUSED in the next branch to OVS_UNUSED as it is now spelled on "master".	2010-02-11 11:11:23 -08:00
Ben Pfaff	67a4917b07	Rename UNUSED macro to OVS_UNUSED to avoid naming conflict. Requested by Jean Tourrilhes <jt@hpl.hp.com>.	2010-02-11 10:59:47 -08:00
Ben Pfaff	b62aeed2ab	netdev-linux: Avoid fiddling with indeterminate data. If we are using netlink to get stats and get_ifindex() fails, then for an internal network device we will then swap around a bunch of indeterminate (uninitialized) data values. That won't hurt anything--the caller will still set them to all-1-bits due to the error--but it still seems wrong. So this commit avoid it. Found using Clang (http://clang-analyzer.llvm.org/).	2010-02-11 10:34:45 -08:00
Jesse Gross	46415c9085	netdev-linux: Use the netdev list of devices instead of cachemap. We previously maintained a list of open devices inside of the linux netdev. Since the netdev library now maintains this list, it is better to use that list instead of our own.	2010-01-18 18:26:44 -05:00
Jesse Gross	49a6a1636f	netdev-linux: Avoid potential issues with unset FD. Never close the file descriptor if it is 0, since it is never a valid FD in this context. Also initialize the FD to -1 so that it is never set to a valid but incorrect value.	2010-01-18 18:23:14 -05:00
Jesse Gross	139faa3116	netdev-linux: Properly store netdev_dev pointer for RTNL callbacks. We were storing a struct netdev_dev_linux ** instead of a netdev_dev_linux * in the cache map. This prevented the cache from being invalidated on changes such as link status.	2010-01-16 09:50:35 -05:00
Justin Pettit	d5cdde1f96	netdev: Increase default ingress policing burst size The default burst rate was 10Kb. This increases it to 1000kb, since we were having problems getting traffic through at 10kb. A better value probably exists between these two points, but that will require additional experimentation.	2010-01-15 19:02:13 -08:00
Ben Pfaff	88258e0034	netdev-linux: Don't close(0) when closing an ordinary netdev. Calling close(0) at random points is bad. It means that the next call to socket() or open() returns fd 0. Then the next time a netdev gets closed, that socket or file fd gets closed too, and you end up with weird "Bad file descriptor" errors. Found by installing the following as lib/unistd.h in the source tree: #ifndef UNISTD_H #define UNISTD_H 1 #include <stdlib.h> #include_next <unistd.h> #undef close #define close(fd) rpl_close(fd) static inline int rpl_close(int fd) { if (!fd) { abort(); } return (close)(fd); } #endif	2010-01-15 15:35:38 -08:00
Jesse Gross	5b7448ed80	netdev-linux: Cleanup tap netdev. TAP devices need to be treated slightly differently from other other devices because they cannot be opened multiple times. Instead we open them once and share the file descriptor. This means that if the netdev is opened multiple times one reader can drain the buffers of another. While this is a deviation from the normal convention, it does not impact current or planned users. In addition, this cleans up some confusion between the file descriptor for tap devices versus other FD's.	2010-01-15 11:34:34 -05:00
Jesse Gross	0b0544d706	gre: Add support for destroying GRE devices. This allows GRE tunnel devices to be torn down on graceful exit of vswitch and cleaned up on restart for non-graceful exits.	2010-01-15 11:34:34 -05:00
Jesse Gross	149f577a25	netdev: Fully handle netdev lifecycle through refcounting. This builds on earlier work that implemented netdev object refcounting. However, rather than requiring explicit create and destroy calls, these operations are now performed automatically based on the referenece count. This is important because in certain situations it is not possible to know whether a netdev has already been created. A workaround existed (which looked fairly similar to this paradigm) but introduced it's own issues. This simplifies and unifies the API.	2010-01-15 11:34:34 -05:00
Ben Pfaff	c100e025e2	netdev-linux: Fix aliasing error. The latest version of GCC flags a common socket convention as breaking strict-aliasing rules. This commit removes the aliasing and gets rid of the scary warning.	2009-12-14 22:59:55 -08:00
Jesse Gross	a740f0de5b	gre: Add userspace GRE support. This implements the userspace portion of GRE on Linux. It communicates with the kernel module to setup tunnels using either Netlink or ioctls as appropriate based on the kernel version. Significant portions of this commit were actually written by Justin Pettit.	2009-12-07 12:48:08 -08:00
Ben Pfaff	58fda1dab1	Merge "master" branch into "db".	2009-12-02 11:49:53 -08:00
Justin Pettit	6c88d577e8	netdev: Allow explicit creation of netdev objects This change adds netdev_create() and netdev_destroy() functions to allow the creation of network devices through the netdev library. Previously, network devices had to already exist or be created on demand through netdev_open(). This caused problems such as not being able to specify TAP devices as ports in ovs-vswitchd, which this patch fixes. This also lays the groundwork for adding GRE and VDE support.	2009-12-01 19:01:01 -08:00
Ben Pfaff	9ab3d9a3c2	netdev: New function netdev_get_ifindex(). sFlow needs the ifindex of an interface, so this commit adds a function to retrieve it.	2009-11-23 12:25:08 -08:00
Ben Pfaff	7671589afb	netdev: Really set output values to 0 on failure in netdev_get_features(). The comment on netdev_get_features() claimed that all of the passed-in values were set to 0 on failure, but the implementation didn't live up to the promise. CC: Paul Ingram <paul@nicira.com>	2009-11-19 13:32:59 -08:00
Ben Pfaff	ec6fde61c8	Add new function xzalloc(n) as a shorthand for xcalloc(1, n).	2009-11-04 14:52:32 -08:00
Ben Pfaff	eb395f2ebf	netdev-linux: Improve netdev_linux_set_etheraddr(). Fixes a bug whereby netdev_linux_set_etheraddr() would update the cached Ethernet address but not mark it valid. (This potentially wasted a system call later but wasn't harmful.) As an added optimization, don't set the Ethernet address at all if the new address is the same as the current address.	2009-10-02 11:04:06 -07:00
Jesse Gross	c0e5f6cabe	netdev-linux: Return correct error codes on receive. netdev_linux_receive was returning positive error codes while the interface specifies that it should be returning negative errors. This difference causes a huge increase in (non-existant) packet processing with the userspace datapath.	2009-10-02 10:36:41 -07:00
Jesse Gross	1a487cec00	netdev-linux: Fix tap device using wrong FD. Tap devices were doing ioctls on the AF_INET socket, instead of the FD opened on the tap device.	2009-09-30 12:43:05 -07:00
Ben Pfaff	576e26d7b4	Merge citrix branch into master.	2009-09-22 10:17:44 -07:00
Jesse Gross	edaa959f6b	netdev-linux: Set missing cache validity bit. Whether a port is internal is cached to avoid requerying the kernel every time stats are requested. However, the cache vality bit was never being set so the cache wasn't used. This corrects that oversight. Thanks to Ben Pfaff for noticing.	2009-09-16 11:03:42 -07:00
Jesse Gross	fe6b0e03f6	netdev: Swap transmit and receive stats on internal ports. Internal ports appear to have their transmit and receive stats swapped because from the kernel's point of view these ports are acting like the machine connected to the switch, not the switch itself. This swaps the stats for consistency with other ports.	2009-09-14 14:12:23 -07:00
Ben Pfaff	f1acd62b54	Merge citrix branch into master.	2009-09-02 10:14:53 -07:00
Ben Pfaff	559843ed53	rtnetlink: Move into separate source and header file. Now that rtnetlink isn't named similarly to netdev_linux, it might as well have its own source and header files to avoid confusing everyone.	2009-07-30 16:07:15 -07:00
Ben Pfaff	d81c0ac56d	rtnetlink: Document.	2009-07-30 16:07:14 -07:00
Ben Pfaff	46097491e4	netdev-linux: Rename "linux_netdev_" to "rtnetlink_". It was getting to be too confusing to have both netdev_linux_* functions and linux_netdev_* functions. Rename the latter to make the distinction more obvious. "rtnetlink" seems to be a fairly good name because that's what the kernel calls it, so the name will be familiar at least to people who know about rtnetlink.	2009-07-30 16:07:14 -07:00
Ben Pfaff	8b61709d5e	netdev: Implement an abstract interface to network devices. This new abstraction layer allows multiple implementations of network devices in a single running process. This will be useful, for example, to support network devices that are simulated entirely in the running process or that communicate with other processes over Unix domain sockets, etc. The reimplemented tap device support in this commit has not been tested.	2009-07-30 16:07:14 -07:00

... 4 5 6 7 8

351 Commits