mir/ovs - ovs - Mike's Git repositories

mir/ovs

mirror of https://github.com/openvswitch/ovs synced 2025-08-28 12:58:00 +00:00

Author	SHA1	Message	Date
Pravin Shelar	9b02078077	datapath: Strip down vport interface : OVS_VPORT_ATTR_MTU There is no need to have vport attribute MTU (OVS_VPORT_ATTR_MTU) as linux net-dev-ioctl can be used to get/set MTU for linux device. Following patch removes OVS_VPORT_ATTR_MTU from datapath protocol. This patch also adds netdev_set_mtu interface. So that MTU adjustments can be done from OVS userspace. get_mtu() interface is also changed, now get_mtu() returns EOPNOTSUPP rather than returning 0 and setting *pmtu to INT_MAX in case there is no MTU attribute for given device. Signed-off-by: Pravin B Shelar <pshelar@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-09-12 17:12:52 -07:00
Pravin Shelar	ff8d7a5e81	Strip down vport interface : iflink Remove iflink from vport interface. iflink is not used anywhere in OVS. So there is not need to have iflink as vport attribute. Signed-off-by: Pravin B Shelar <pshelar@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-09-08 15:18:42 -07:00
Ethan Jackson	c7178a0b3e	dpif-linux: Stop listening for RTNL notifications. Currently dpif-linux listens for vport change events using rtnetlink notifications. This patch switches to the ovs genl notification system. Feature #6809.	2011-09-01 17:22:00 -07:00
Ethan Jackson	0a811051ff	netlink-notifier: Rename rtnetlink code. This patch renames the rtnetlink module's code to "nln" for "netlink notifier". Callers are now required to pass in the netlink protocol to he newly renamed nln_create() function.	2011-09-01 17:18:52 -07:00
Ethan Jackson	45c8d3a189	lib: Rename rtnetlink.[ch] files. The only rtnetlink specific functionality contained in the rtnetlink module is the use of the NETLINK_ROUTE protocol. This can easily be passed in by callers. In preparation for generalization, this patch renames rtnetlink.[ch] to netlink-notifier.[ch]. Future patches will complete the transition.	2011-09-01 17:18:51 -07:00
Justin Pettit	24b019f808	datapath: Disable LRO from userspace instead of the kernel. Whenever a port is added to the datapath, LRO is automatically disabled. In the future, we may want to enable LRO in some circumstances, so have userspace disable LRO through the ethtool ioctls. As part of this change, the MTU and LRO checks are moved to netdev-vport's send(), which is where they're actually needed. Feature #6810 Signed-off-by: Justin Pettit <jpettit@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-08-28 21:30:58 -07:00
Ethan Jackson	d8abf60c72	dpif-linux: Call rtnetlink_notifier_run() as required. I don't think this actually fixes a bug, as netdev-linux calls this function. However, it seems stylistically more correct.	2011-08-22 13:06:04 -07:00
Justin Pettit	df2c07f433	datapath: Use "OVS_" as opposed to "ODP_" for user<->kernel interactions. The prefix "ODP_" is not overly descriptive in the context of the larger Linux tree. This commit changes the prefix to "OVS_" for the userpace to kernel interactions. The userspace libraries still use "ODP_" in many of their interfaces since it is more descriptive in the OVS oeuvre. Feature #6904 Signed-off-by: Justin Pettit <jpettit@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-08-19 22:48:23 -07:00
Ben Pfaff	d3c110b648	dpif-linux: Fix memory and file descriptor leak in dpif_linux_close(). Found with valgrind.	2011-06-07 13:20:35 -07:00
Ethan Jackson	eb8b28e7da	dpif-linux: Avoid duplicate code in dpif_linux_vport_send(). dpif_linux_vport_send() had duplicated most of the code in dpif_linux_execute() in order to execute output actions in the kernel. This forces developers to remember to change both functions whenever the kernel interface changes. In particular, commit 80e5eed9 "datapath: Get packet metadata from userspace in odp_packet_cmd_execute()." broke netdev_linux_vport_send(). This commit reorganizes the code and fixes the regression. Bug #5818.	2011-06-06 11:13:22 -07:00
Ben Pfaff	80e5eed9c2	datapath: Get packet metadata from userspace in odp_packet_cmd_execute(). Until now, the tun_id and in_port have been lost when a packet is sent from the kernel to userspace and then back to the kernel. I didn't think that this was a problem, but recent behavior made me look closer and see that it makes a difference if sFlow is turned on or if an ODP_ATTR_ACTION_CONTROLLER action is present. We could possibly kluge around those, but for future-proofing it seems better to pass the packet metadata from userspace to the kernel. That is what this commit does. This commit introduces a user-kernel protocol break. We could avoid that, if it is desirable, by making ODP_PACKET_ATTR_KEY optional for ODP_PACKET_CMD_EXECUTE commands. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-06-01 13:39:51 -07:00
Ben Pfaff	b2fda3effc	Merge 'next' into 'master'. I know already that this breaks the statsfixes that were implemented by the following commits: 827ab71c97f "ofproto: Datapath statistics accounted twice." 6f1435fc8f7 "ofproto: Resubmit statistics improperly account during..." These were already broken in a previous merge. I will work on a fix.	2011-05-18 14:01:13 -07:00
Ben Pfaff	d3d8f1f7e5	Add missing "static" keywords. Found by sparse.	2011-05-16 13:40:47 -07:00
Ben Pfaff	0079481775	Merge 'master' into 'next'.	2011-05-12 12:05:42 -07:00
Ben Pfaff	640e1b2077	dpif: Improve abstraction by making 'run' and 'wait' functions per-dpif. Until now, the dp_run() and dp_wait() functions had to be called at the top level of the program because they applied to every open dpif. By replacing them by functions that take a specific dpif as an argument, we can call them only from ofproto, which is currently the correct layer to deal with dpifs.	2011-05-11 12:26:07 -07:00
Ben Pfaff	032aa6a354	ovs-dpctl: Add -s option to print packet and byte counters.	2011-05-02 09:33:12 -07:00
Ethan Jackson	8522b38386	dpif-linux: Recycle leaked ports. When ports are deleted from the datapath they need to be added to an LRU list maintained in dpif-linux so they may be reallocated. When using vswitchd to delete the ports this happens automatically. However, if a port is deleted directly from the datapath it is never reclaimed by dpif-linux. If this happens often, eventually no ports will be available for allocation and dpif-linux will fall back to using the old, kernel implemented, allocation strategy. This commit fixes the problem by automatically reclaiming ports missing from the datapath whenever the list of ports in the datapath is dumped. Bug #2140.	2011-04-29 16:06:31 -07:00
Ben Pfaff	141d9ce465	dpif-linux: Avoid logging error on ENOENT in dpif_linux_is_internal_device(). ENOENT can be returned if the kernel module isn't loaded. If that's the case then we've already logged that and there's no point in logging it again.	2011-04-11 10:46:39 -07:00
Ben Pfaff	42bb6c72b5	dpif-linux: Avoid segfault on netdev_get_stats() without kernel module. netdev_linux_get_stats() calls into netdev_vport_get_stats(), which in turn attempts a transaction on genl_sock. If the kernel module isn't loaded, then genl_sock won't be there, and in any case there's nothing that guarantees that it's been initialized yet. This fixes the problem by ensuring that dpif_linux was initialized properly before attempting a transaction on genl_sock. Reported-by: Aaron Rosen <arosen@clemson.edu>	2011-04-11 10:46:32 -07:00
Ethan Jackson	773cd53821	dpif-linux: Choose port numbers more prudently. Before this patch the kernel chose the lowest available number for newly created datapath ports. This patch moves the port number choosing responsibility to user space, and implements a least recently used port number queue in an attempt to avoid reuse. Bug #2140.	2011-04-05 20:40:27 -07:00
Ben Pfaff	7feba1acdd	netdev-vport: Implement 'send' function. The new implementation of the bonding code expects to be able to send packets on netdevs using netdev_send(). This implements it.	2011-04-01 15:52:19 -07:00
Ben Pfaff	d0c23a1a57	dpif: Use sset instead of svec in dpif interface.	2011-03-31 16:42:01 -07:00
Ben Pfaff	b3c01ed330	Convert shash users that don't use the 'data' value to sset instead. In each of the cases converted here, an shash was used simply to maintain a set of strings, with the shash_nodes' 'data' values set to NULL. This commit converts them to use sset instead.	2011-03-31 16:42:01 -07:00
Ben Pfaff	f915f1a8ca	datapath: Consider tunnels to have no MTU, fixing jumbo frame support. Until now, tunnel vports have had a specific MTU, in the same way that ordinary network devices have an MTU, but treating them this way does not always make sense. For example, consider a datapath that has three ports: the local port, a GRE tunnel to another host, and a physical port. If the physical port is configured with a jumbo MTU, it should be possible to send jumbo packets across the tunnel: the tunnel can do fragmentation or the physical port traversed by the tunnel might have a jumbo MTU. However, until now, tunnels always had a 1500-byte MTU by default. It could be adjusted using ODP_VPORT_MTU_SET, but nothing actually did this. One alternative would be to make ovs-vswitchd able to set the vport's MTU. This commit, however, takes a different approach, of dropping the concept of MTU entirely for tunnel vports. This also solves the problem described above, without making any additional work for anyone. I tested that, without this change, I could not send 1600-byte "pings" between two machines whose NICs had 2000-byte MTUs that were connected to vswitches that were in turn connected over GRE tunnels with the default 1500-byte MTU. With this change, it worked OK, regardless of the MTU of the network traversed by the GRE tunnel. This patch also makes "patch" ports MTU-less. It might make sense to remove vport_set_mtu() and the associated callback now, since ordinary network devices are the only vports that support it now. Signed-off-by: Ben Pfaff <blp@nicira.com> Suggested-by: Jesse Gross <jesse@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com> Bug #3728.	2011-02-04 09:46:26 -08:00
Ben Pfaff	3005302426	datapath: Dump flow actions only if there is room. Expanding an skbuff in a netlink dump handler doesn't work well. We weren't updating the truesize of the skb or the allocation within the socket that netlink_dump() had put the skb in. The code had other bugs too. This commit fixes the problem (in my tests, anyway) by avoiding expanding the reply skbuff to fill in the actions. Instead, in such a case the userspace client has to do a separate "get" action to get the actions. This commit also updates userspace to do this automatically for dumps in the cases where the caller cares (only "ovs-dpctl dump-flows" currently cares). Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com> Bug #4520.	2011-02-01 09:25:26 -08:00
Ben Pfaff	d2a23af251	dpif-linux: Always pass an actions attribute in dpif_flow_put(). The kernel expects that ODP_FLOW_NEW always has an ODP_FLOW_ATTR_ACTIONS attribute, even though that attribute may be empty to drop all of the packets in the flow. Similarly, ODP_FLOW_SET as used by dpif_linux_flow_put() should always have such an attribute, since it is used by OVS to update the flow's actions. So make it possible for dpif_linux_flow_to_ofpbuf() to pass an empty actions attribute, and make dpif_linux_flow_put() always force that behavior if the actions_len passed to it is 0. This fixes EINVAL error creating flows to drop packets. Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-31 21:40:07 -08:00
Jesse Gross	9e980142f4	dpif-linux: Read flow used time. We were never storing the flow used time from the Netlink message into our local struct, which caused flows to timeout prematurely. Acked-by: Ben Pfaff <blp@nicira.com>	2011-01-31 14:57:04 -08:00
Jesse Gross	f9ef1c31cf	dpif-linux: Add missing NLM_F_ECHO flag to flow requests. Flow transactions expect a response after the operation has completed but the request did not have NLM_F_ECHO set. This caused userspace to receive only the Netlink ACK instead of a real response, making it appear that the operation had failed when it actually succeeded.	2011-01-29 18:09:37 -08:00
Ethan Jackson	4ca85130db	dpif-linux: Remove extraneous name variable. Fixes a "used uninitialized" warning.	2011-01-29 13:57:08 -08:00
Ben Pfaff	3d8c95357f	dpif: Remove dpif_get_all_names(). None of the remaining dpif implementations have more than one name per dpif, so there's no need for this function anymore. Suggested-by: Jesse Gross <jesse@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-28 15:34:40 -08:00
Ben Pfaff	254f2dc8e3	datapath: Change dp_idx to dp_ifindex, the ifindex of the local port. I can't see any real value in maintaining a dp_idx separate from the ifindex of the local port. With the current implementation it also artificially limits the number of datapaths. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-28 15:34:40 -08:00
Ben Pfaff	957709ea81	dpif-linux: Replace 'minor' by 'dp_idx'. The dp_idx used to be the character device minor number, but there's no character device anymore, so rename for clarity. Reviewed by Justin Pettit.	2011-01-28 15:34:40 -08:00
Ben Pfaff	37a1300c3c	datapath: Convert ODP_FLOW_* commands to use AF_NETLINK socket layer. This completes the transition to the Generic Netlink interface, and so this commit restores support for Linux 2.6.18 and later. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-28 15:34:30 -08:00
Ben Pfaff	f0fef76062	datapath: Convert ODP_VPORT_* to use AF_NETLINK socket layer. This commit calls genl_lock() and thus doesn't support Linux before 2.6.35, which wasn't exported before that version. That problem will be fixed once the whole userspace interface transitions to Generic Netlink a few commits from now. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-28 15:34:27 -08:00
Ben Pfaff	aaff4b55a7	datapath: Convert ODP_DP_* commands to use AF_NETLINK socket layer. This commit calls genl_lock() and thus doesn't support Linux before 2.6.35, which wasn't exported before that version. That problem will be fixed once the whole userspace interface transitions to Generic Netlink a few commits from now. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-28 13:55:04 -08:00
Ben Pfaff	982b88105d	datapath: Convert upcalls and ODP_EXECUTE to use AF_NETLINK socket layer. This commit calls genl_lock() and thus doesn't support Linux before 2.6.35, which wasn't exported before that version. That problem will be fixed once the whole userspace interface transitions to Generic Netlink a few commits from now. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-28 12:17:03 -08:00
Ben Pfaff	82272eded1	Eliminate ODPL_* from userspace-facing interface. Reviewed by Justin Pettit.	2011-01-27 21:08:41 -08:00
Ben Pfaff	f7cd0081f5	datapath: Convert ODP_EXECUTE to use Netlink framing. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-27 21:08:40 -08:00
Ben Pfaff	d656937779	datapath: Convert datapath operations to use Netlink framing. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-27 21:08:40 -08:00
Ben Pfaff	9c52546b52	datapath: Convert ODP_FLOW_* and ODP_EXECUTE to put dp_idx into message. When the datapath moves to the Netlink protocol it won't have a minor number to use, so we have to put the dp_idx in the message. This also changes the kernel implementation of ODP_FLOW_FLUSH to do the datapath locking inside flush_flows() instead of inside openvswitch_ioctl() but doesn't change that command's userspace interface, which still passes a datapath number as the ioctl argument. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-27 21:08:40 -08:00
Ben Pfaff	693c4a0112	datapath: Eliminate 'flags' member from odp_flow. Nothing was productively using the 'flags' member of odp_flow, so this commit removes it. ODPFF_ZERO_TCP_FLAGS isn't used at all (as of the previous commit). ODPFF_EOF has been replaced by a special case of the 'key_len' member. This will go away, too, once AF_NETLINK starts being used. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-27 21:08:39 -08:00
Ben Pfaff	ba25b8f41f	dpif: Eliminate ODPPF_* constants from client-visible interface. Following this commit, the ODPPF_* constants are only used in Linux-specific parts of OVS userspace code. This allows the actual Linux datapath interface to evolve more freely. Reviewed by Justin Pettit.	2011-01-27 21:08:39 -08:00
Ben Pfaff	c97fb13280	dpif: Eliminate "struct odp_flow_stats" from client-visible interface. Following this commit, "struct odp_flow_stats" is only used in Linux-specific parts of OVS userspace code. This allows the actual Linux datapath interface to evolve more freely. Reviewed by Justin Pettit.	2011-01-27 21:08:38 -08:00
Ben Pfaff	feebdea2e5	dpif: Eliminate "struct odp_flow" from client-visible interface. Following this commit, "struct odp_flow" and related data structures are only used in Linux-specific parts of OVS userspace code. This allows the actual Linux datapath interface to evolve more freely. Reviewed by Justin Pettit.	2011-01-27 21:08:38 -08:00
Ben Pfaff	bc4a05c639	datapath: Change ODP_FLOW_GET to retrieve only a single flow at a time. This brings the code closer to what the Netlink interface will need to implement. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-27 21:08:38 -08:00
Ben Pfaff	996c1b3d7a	datapath: Drop port information from odp_stats. As with n_flows, n_ports was used regularly by userspace to determine how much memory to allocate when listing ports, but it is no longer needed for that. max_ports, on the other hand, is necessary but it is also a fixed value for the kernel datapath right now and if we expand it we can also come up with a way to report the expanded value. The remaining members of odp_stats are actually real statistics that I intend to keep. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-27 21:08:38 -08:00
Ben Pfaff	1ba530f4b2	datapath: Drop queue information from odp_stats. This queue information will be available through the kernel socket layer once we move over to Netlink socket as transports, so we might as well get rid of the redundancy. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-27 21:08:38 -08:00
Ben Pfaff	c19e653509	datapath: Change userspace vport interface to use Netlink attributes. One of the goals for Open vSwitch is to decouple kernel and userspace software, so that either one can be upgraded or rolled back independent of the other. To do this in full generality, it must be possible to add new features to the kernel vport layer without changing userspace software. The customary way to do this in the Linux networking stack is to use Netlink and in particular Netlink attributes. This commit adopts that model for the vport layer. It does not yet actually start using the Netlink socket layer, which will come later. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-27 21:08:37 -08:00
Ben Pfaff	c283069c71	datapath: Change vport type from string to integer enumeration. I plan to make the vport type part of the standard header stuck on each Netlink message related to a vport. As such, it is more convenient to use an integer than a string. In addition, by being fundamentally different from strings, using an integer may reduce the confusion we've had in the past over the differences in userspace and kernel names for network device and vport types. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Jesse Gross <jesse@nicira.com>	2011-01-27 21:08:37 -08:00
Ben Pfaff	4c738a8da5	dpif: Eliminate "struct odp_port" from client-visible interface. Following this commit, "struct odp_port" is only used in Linux-specific parts of OVS userspace code. This allows the actual Linux datapath interface to evolve more freely. Reviewed by Justin Pettit.	2011-01-27 21:08:37 -08:00

1 2

97 Commits