mir/ovs - ovs - Mike's Git repositories

mir/ovs

mirror of https://github.com/openvswitch/ovs synced 2025-08-28 12:58:00 +00:00

Author	SHA1	Message	Date
Ciara Loftus	fc82e877ef	dpif-netdev: Increase the number of EMC entries Prior to this commit, the number of possible entries in the Exact Match Cache stood at 1024 per thread exacting to 0.18Mb. A typical server system will have 2.5Mb cache per core meaning a larger EMC will comfortably fit in. This patch increases the number of entries to 8192 per thread (1.4Mb) which in turn yields improved throughput when processing multiple flows of traffic. Signed-off-by: Ciara Loftus <ciara.loftus@intel.com> Signed-off-by: Ethan Jackson <ethan@nicira.com> Acked-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-05-21 13:46:43 -07:00
Ethan Jackson	cd159f1a82	dpdk: Ditch MAX_PKT_BURST macro. The MAX_PKT_BURST and NETDEV_MAX_RX_BATCH macros had a confusing relationship. They basically purport to do the same thing, making it unclear which is the source of truth. Furthermore, while NETDEV_MAX_RX_BATCH was 256, MAX_PKT_BURST was 32, meaning we never process a batch larger than 32 packets further adding to the confusion. This patch resolves the issue by removing MAX_PKT_BURST completely, and shrinking the new NETDEV_MAX_BURST macro to only 32. This should have no change in the execution path except shrinking a couple of structs and memory allocations (can't hurt). Signed-off-by: Ethan Jackson <ethan@nicira.com> Acked-by: Daniele Di Proietto <diproiettod@vmware.com>	2015-05-19 14:47:00 -07:00
Daniele Di Proietto	8aaa125dab	dpif-netdev: Share emc and fast path output batches. Until now the exact match cache processing was able to handle only four megaflows. The rest of the packets was passed to the megaflow classifier. The limit was arbitraly set to four also because the algorithm used to group packets in output batches didn't perform well with a lot of megaflows. After changing the algorithm and after some performance testing it seems much better just to share the same output batches between the exact match cache and the megaflow classifier. Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2015-05-18 15:14:02 -07:00
Daniele Di Proietto	11e5cf1f90	dpif-netdev: Store batch pointer in dp_netdev_flow. The userspace datapath 1. receives a batch of packets. 2. finds a 'netdev_flow' (megaflow) for each packet. 3. groups the packets in output batches based on the 'netdev_flow'. Until now the grouping (2) was done using a simple algorithm with a O(N^2) runtime, where N is the number of distinct megaflows of the packets in the incoming batch. This could quickly become a bottleneck, even with a small number of megaflows. With this commit the datapath simply stores in the 'netdev_flow' (the megaflow) a pointer to the output batch, if one has been created for the current input batch. The pointer will be cleared when the output batch is sent. In a simple phy2phy test with 128 megaflows the throughput is more than doubled. The reason that stopped us from doing this change was that the 'netdev_flow' memory was shared between multiple threads: this is no longer the case with the per-thread classifier. Also, this commit reorders struct dp_netdev_flow to group toghether the members used in the fastpath. Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2015-05-18 15:14:02 -07:00
Daniele Di Proietto	efa2bcbb35	dpif-netdev: Store pkt_metadata structure in dp_netdev_port. Initializing a struct pkt_metadata for every packet can be surprisingly expensive. It's much faster to keep a copy for each port and copying it on each packet. Suggested-by: Pravin Shelar <pshelar@nicira.com> Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2015-05-18 15:14:02 -07:00
Daniele Di Proietto	28e2fa027d	dpif-netdev: Batch packets when recirculating. Now that we have per packet metadata, there's no need to split packet batches when recirculating. Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-20 12:56:29 -07:00
Daniele Di Proietto	2bc1bbd27d	dp-packet: Rename 'dp_hash' in 'rss_hash'. We already have the 'dp_hash' embedded in the metadata. This caused confusion in the code. With this commit it should be clear that 'rss_hash' is the packet hash used for internal purposes, while 'md.dp_hash' is part of the flow, computed during the execution of certain actions. Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-20 12:49:41 -07:00
Daniele Di Proietto	11bfdaddf2	dpif-netdev: Cache time_msec() calls for each received batch. Calling time_msec() (which calls clock_gettime()) too often might be expensive. With this commit OVS makes only one call per received batch and caches the result. Suggested-by: Ethan Jackson <ethan@nicira.com> Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-20 12:49:41 -07:00
Daniele Di Proietto	9ff55ae284	dpif-netdev: Store actions data and size contiguously. As stated by the comment above the structure, the 'action' pointer does not change during the 'dp_netdev_actions' lifetime: we might as well embed the pointed memory into the structure. The commit also updates the description of dp_netdev_actions_create(). Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-20 12:49:41 -07:00
Ben Pfaff	17050610ec	dpif-netdev: Reject adding duplicate ports. Otherwise it is at least very confusing. Found during testing. An upcoming commit adds a test. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Daniele Di Proietto <diproiettod@vmware.com>	2015-04-16 08:13:10 -07:00
Daniele Di Proietto	6553d06bd1	dpif-netdev: Add dpif-netdev/pmd-stats-* appctl commands. These commands can be used to get packets and cycles counters on a pmd thread basis. They're useful to get a clearer picture about the performance of the userspace datapath. They export these pieces of information: - A (per-thread) view of the caches hit rate. Hits in the exact match cache are reported separately from hits in the masked classifier - A rough cycles count. This will allow to estimate the load of OVS and the polling overhead. Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-14 12:31:30 -07:00
Daniele Di Proietto	c8973eb634	dpif-provider: Add class init function. This init function is called when the dpif class is registered. It will be used by following commits Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-14 12:30:11 -07:00
Daniele Di Proietto	55e3ca97d1	dpif-netdev: Add simple per pmd-thread cycles counters. The counters use x86 TSC if available (currently only with DPDK). They will be exposed by subsequents commits Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-14 12:28:50 -07:00
Daniele Di Proietto	abcf3ef4c3	dpif-netdev: Count exact match cache hits. We used to count exact match cache hits and masked classifier hits together. This commit splits the DP_STAT_HIT counter into two. This change will be used by future commits. Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-09 15:00:52 -07:00
Daniele Di Proietto	eb94da30ae	dpif-netdev: Make datapath and flow stats atomic. A read operation from a non atomic shared value (without external locking) can return incorrect values. Using the atomic semantics prevents this from happening. However: * No memory barriers are used. We don't need that kind of consistency for statistics (we use relaxed operations). * The updates are not atomic, just the loads and stores. This is ok because there's a single writer. Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-09 15:00:52 -07:00
Daniele Di Proietto	60fc3b7ba4	dpif-netdev: Group statistics updates in the slow path. Since statistics updates might require locking (in future commits) grouping them will reduce the locking overhead. Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-09 15:00:52 -07:00
Daniele Di Proietto	97447f55a9	dpif-netdev: Remove support for DPIF_FP_ZERO_STATS flag Since flow statistics are thread local and updated without any lock, it is not correct to do a memset from another thread. This commit simply removes the support for the flag. It is not needed by ofproto-dpif, it is only exposed by dpctl commands. Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-04-02 17:55:17 -07:00
Daniele Di Proietto	7ad20cbd96	dpif-netdev: Account for and free lost packets. Packets for which an upcall has failed (lost packets) must be deleted. We also need to count them as MISS and LOST. Signed-off-by: Daniele Di Proietto <diproiettod@vmware.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2015-03-30 13:17:41 -07:00
Pravin B Shelar	6fd6ed71cb	ofpbuf: Simplify ofpbuf API. ofpbuf was complicated due to its wide usage across all layers of OVS, Now we have introduced independent dp_packet which can be used for datapath packet, we can simplify ofpbuf. Following patch removes DPDK mbuf and access API of ofpbuf members. Signed-off-by: Pravin B Shelar <pshelar@nicira.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2015-03-03 13:37:39 -08:00
Pravin B Shelar	cf62fa4c70	dp-packet: Remove ofpbuf dependency. Currently dp-packet make use of ofpbuf for managing packet buffers. That complicates ofpbuf, by making dp-packet independent of ofpbuf both libraries can be optimized for their own use case. This avoids mapping operation between ofpbuf and dp_packet in datapath upcalls. Signed-off-by: Pravin B Shelar <pshelar@nicira.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2015-03-03 13:37:37 -08:00
Pravin B Shelar	e14deea0bd	dpif_packet: Rename to dp_packet dp_packet is short and better name for datapath packet structure. Signed-off-by: Pravin B Shelar <pshelar@nicira.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com>	2015-03-03 13:37:34 -08:00
Ethan Jackson	4c75aaabb1	dpif-netdev: Fix rare flow add race condition. Before this patch, dp_netdev_flow_add() inserted newly minted flows in the "flow_table" cmap before inserting them into the per core "dpcls" classifier. Since dpcls_insert() initializes 'flow->cr.mask', there's a brief window where the flow is accessible from the cmap, but has a bogus mask value. In my testing, under rare instances (i.e. once every 20 minutes with a very specific flow table and traffic pattern), revalidators core dump when they call dpif_netdev_flow_dump_next(), which accesses this bogus mask value from dp_netdev_flow_to_dpif_flow(). By inserting into the per core classifier before the cmap, all the values are guaranteed to be initialized during flow dumps. With this patch, I can no longer reproduce the crash. Signed-off-by: Ethan Jackson <ethan@nicira.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com>	2015-01-07 18:15:13 -08:00
Jarno Rajahalme	d70e8c28f9	miniflow: Use 64-bit data. So far the compressed flow data in struct miniflow has been in 32-bit words with a 63-bit map, allowing for a maximum size of struct flow of 252 bytes. With the forthcoming Geneve options this is not sufficient any more. This patch solves the problem by changing the miniflow data to 64-bit words, doubling the flow max size to 504 bytes. Since the word size is doubled, there is some loss in compression efficiency. To counter this some of the flow fields have been reordered to keep related fields together (e.g., the source and destination IP addresses share the same 64-bit word). This change should speed up flow data processing on 64-bit CPUs, which may help counterbalance the impact of making the struct flow bigger in the future. Classifier lookup stage boundaries are also changed to 64-bit alignment, as the current algorithm depends on each miniflow word to not be split between ranges. This has resulted in new padding (part of the 'mpls_lse' field). The 'dp_hash' field is also moved to packet metadata to eliminate otherwise needed padding there. This allows the L4 to fit into one 64-bit word, and also makes matches on 'dp_hash' more efficient as misses can be found already on stage 1. Signed-off-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2015-01-06 14:47:30 -08:00
Jarno Rajahalme	aae7c34f04	hash: Add hash_add64(). Add support for adding 64-bit words to hashes. This will be used by subsequent patches. Signed-off-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2015-01-06 14:47:30 -08:00
Alex Wang	1c1e46ed84	dpif-netdev: Add per-pmd flow-table/classifier. This commit changes the per dpif-netdev datapath flow-table/ classifier to per pmd-thread. As direct benefit, datapath and flow statistics no longer need to be protected by mutex or be declared as per-thread variable, since they are only written by the owning pmd thread. As side effects, the flow-dump output of userspace datapath can contain overlapping flows. To reduce confusion, the dump from different pmd thread will be separated by a title line. In addition, the flow operations via 'ovs-appctl dpctl/*' are modified so that if the given flow in_port corresponds to a dpdk interface, the operation will be conducted to all pmd threads recv from that interface (expect for flow-get which will always be applied to non-pmd threads). Signed-off-by: Alex Wang <alexw@nicira.com> Tested-by: Mark D. Gray <mark.d.gray@intel.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2014-12-30 11:47:30 -08:00
Alex Wang	b19befaef2	dpif-netdev: Add function to get pmd using core id. This commit adds the function dp_netdev_get_pmd() which allows users to get 'struct dp_netdev_pmd_thread' based on the core id. Signed-off-by: Alex Wang <alexw@nicira.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2014-12-30 11:47:18 -08:00
Joe Stringer	8e1ffd757d	dpif: Shift ufid support checking up to dpif_backer. Previously, the dpif layer was responsible for determining datapath support for UFIDs, which resulted in all ovs-dpctl utilities inserting/deleting flows from the datapath each time they are run. Shift this responsibility up to the dpif_backer. There are two users of this functionality: Revalidators check for UFID support to request a terser dump using UFIDs, and dpif-netlink uses this to request flow_del operations to only return the UFID/stats. The latter case was previously hidden from revalidators, but this change makes them aware of it, and reuses the same "udpif->enable_ufid" flag for reducing overhead of both flow dump and flow delete. Signed-off-by: Joe Stringer <joestringer@nicira.com> Acked-by: Andy Zhou <azhou@nicira.com>	2014-12-19 12:57:41 -08:00
Thomas Graf	e6211adce4	lib: Move vlog.h to <openvswitch/vlog.h> A new function vlog_insert_module() is introduced to avoid using list_insert() from the vlog.h header. Signed-off-by: Thomas Graf <tgraf@noironetworks.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-12-15 14:15:19 +01:00
Joe Stringer	64bb477f05	dpif: Minimize memory copy for revalidation. One of the limiting factors on the number of flows that can be supported in the datapath is the overhead of assembling flow dump messages in the datapath. This patch modifies the dpif to allow revalidators to skip dumping the key, mask and actions from the datapath, by making use of the unique flow identifiers introduced in earlier patches. For each flow dump, the dpif user specifies whether to skip these attributes, allowing the common case to only dump a pair of 128-bit ID and flow stats. With datapath support, this increases the number of flows that a revalidator can handle per second by 50% or more. Support in dpif-netdev and dpif-netlink is added in this patch; kernel support is left for future patches. Signed-off-by: Joe Stringer <joestringer@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-12-02 14:55:55 -08:00
Joe Stringer	70e5ed6f39	dpif: Index flows using unique identifiers. This patch modifies the dpif interface to allow flows to be manipulated using a 128-bit identifier. This allows revalidator threads to perform datapath operations faster, as they do not need to serialise the entire flow key for operations like flow_get and flow_delete. In conjunction with a future patch to simplify the dump interface, this provides a significant performance benefit for revalidation. When handlers assemble flow_put operations, they specify a unique identifier (UFID) for each flow as it is passed down to the datapath to be stored with the flow. The UFID is currently provided to handlers by the dpif during upcall processing. When revalidators assemble flow_get or flow_del operations, they may specify the UFID for the flow along with the key. The dpif will decide whether to send only the UFID to the datapath, or both the UFID and flow key. The former is preferred for newer datapaths that support UFID, while the latter is used for backwards compatibility. Signed-off-by: Joe Stringer <joestringer@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-12-02 14:10:23 -08:00
Alex Wang	433330a8bf	dpif-netdev: Fix a race. On current master, the 'struct dp_netdev_port' is destroyed immediately when the ref count reaches 0. However, non-pmd threads calling the dpif_netdev_execute() for sending packets could hold pointer to 'port' that is not ref-counted. Thusly those threads could possibly access freed memory when the port is deleted. To fix this bug, this commit makes non-pmd threads acquiring the 'port_mutex' before doing the actual execution in dpif_netdev_execute(). Signed-off-by: Alex Wang <alexw@nicira.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2014-12-01 10:53:19 -08:00
Joe Stringer	7af12bd7c8	dpif: Generate flow_hash for revalidators in dpif. This patch shifts the responsibility for determining the hash for a flow from the revalidation logic down to the dpif layer. This assists in handling backward-compatibility for revalidation with the upcoming unique flow identifier "UFID" patches. A 128-bit UFID was selected to minimize the likelihood of hash conflicts. Handler threads will not install a flow that has an identical UFID as another flow, to prevent misattribution of stats and to ensure that the correct flow key cache is used for revalidation. For datapaths that do not support UFID, which is currently all datapaths, the dpif will generate the UFID and pass it up during upcall and flow_dump. This is generated based on the datapath flow key. Later patches will add support for datapaths to store and interpret this UFID, in which case the dpif has a responsibility to pass it through transparently. Signed-off-by: Joe Stringer <joestringer@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-11-25 14:12:25 -08:00
Pravin B Shelar	53e1d6f1ef	dpif-netdev: Remove redundant hash action handling. odp_execute_actions() already handles hash execution part. Signed-off-by: Pravin B Shelar <pshelar@nicira.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com>	2014-11-21 15:51:42 -08:00
Alex Wang	67ad54cbc8	dpif-netdev: Garbage collect the exact match cache periodically. On current master, the exact match cache entry can keep reference to 'struct dp_netdev_flow' even after the flow is removed from the flow table. This means the free of allocated memory of the flow is delayed until the exact match cache entry is cleared or replaced. If the allocated memory is ahead of chunks of freed memory on heap, the delay will prevent the reclaim of those freed chunks, causing falsely high memory utilization. To fix the issue, this commit makes the owning thread conduct periodic garbage collection on the exact match cache and clear dead entries. Signed-off-by: Alex Wang <alexw@nicira.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com> --- PATCH -> V2: - Adopt Jarno's suggestion and conduct slow sweep to avoid introducing jitter.	2014-11-21 08:06:41 -08:00
Jarno Rajahalme	802f84ffd7	classifier: Defer pvector publication. This patch adds a new functions classifier_defer() and classifier_publish(), which control when the classifier modifications are made available to lookups. By default, all modifications are made available to lookups immediately. Modifications made after a classifier_defer() call MAY be 'deferred' for later 'publication'. A call to classifier_publish() will both publish any deferred modifications, and cause subsequent changes to to be published immediately. Currently any deferring is limited to the visibility of the subtable vector changes. pvector now processes modifications mostly in a working copy, which needs to be explicitly published with pvector_publish(). pvector_publish() sorts the working copy and removes gaps before publishing it. This change helps avoiding O(n**2) memory behavior in corner cases, where large number of rules with different masks are inserted or deleted. VMware-BZ: #1322017 Signed-off-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-11-14 16:00:46 -08:00
Alex Wang	accf86266a	dpif-netdev: Allow direct destroy of 'struct dp_netdev_port'. Before this commit, when 'struct dp_netdev_port' is deleted from 'dpif-netdev' datapath, if there is pmd thread, the pmd thread will release the last reference to the port and ovs-rcu postpone the destroy. However, the delayed close of object like 'struct netdev' could cause failure in immediate re-add or reconfigure of the same device. To fix the above issue, this commit uses condition variable and makes the main thread wait for pmd thread to release the reference when deleting port. Then, the main thread can directly destroy the port. Reported-by: Cian Ferriter <cian.ferriter@intel.com> Signed-off-by: Alex Wang <alexw@nicira.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2014-11-12 15:59:12 -08:00
Alex Wang	f7d636527b	dpif-netdev: Move 'struct dp_netdev_port' initialization before use. There is a portion of the 'struct dp_netdev_port' initialization that is placed after the reload of pmd threads. This means in theory, there could be a race where pmd threads access half- initialized struct. Although such race has not been seen, it makes sense to fully initialize the struct before use. Found by code inspection. Signed-off-by: Alex Wang <alexw@nicira.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2014-11-12 15:59:11 -08:00
Pravin B Shelar	a36de779d7	openvswitch: Userspace tunneling. Following patch adds support for userspace tunneling. Tunneling needs three more component first is routing table which is configured by caching kernel routes and second is ARP cache which build automatically by snooping arp. And third is tunnel protocol table which list all listening protocols which is populated by vswitchd as tunnel ports are added. GRE and VXLAN protocol support is added in this patch. Tunneling works as follows: On packet receive vswitchd check if this packet is targeted to tunnel port. If it is then vswitchd inserts tunnel pop action which pops header and sends packet to tunnel port. On packet xmit rather than generating Set tunnel action it generate tunnel push action which has tunnel header data. datapath can use tunnel-push action data to generate header for each packet and forward this packet to output port. Since tunnel-push action contains most of packet header vswitchd needs to lookup routing table and arp table to build this action. Signed-off-by: Pravin B Shelar <pshelar@nicira.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Thomas Graf <tgraf@noironetworks.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-11-12 15:08:33 -08:00
Andy Zhou	b5cbbcf656	bridge: Store datapath version into ovsdb OVS userspace are backward compatible with older Linux kernel modules. However, not having the most up-to-date datapath kernel modules can some times lead to user confusion. Storing the datapath version in OVSDB allows management software to check and optionally provide notifications to users. Signed-off-by: Andy Zhou <azhou@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-11-05 15:27:38 -08:00
David Verbeiren	7251515ea9	netdev-dpdk: Fix DPDK rings broken by multi queue DPDK rings don't need one queue per PMD thread and don't support multiple queues (set_multiq function is undefined). To fix operation with DPDK rings, this patch ignores EOPNOTSUPP error on netdev_set_multiq() and provides, for DPDK rings, a netdev send() function that ignores the provided queue id (= PMD thread core id). Suggested-by: Maryam Tahhan <maryam.tahhan@intel.com> Signed-off-by: David Verbeiren <david.verbeiren@intel.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2014-11-04 09:21:27 -08:00
Jarno Rajahalme	caeb4906d4	lib/dpif-netdev: Fix EMC lookup. Patch 0de8783a9 (lib/dpif-netdev: Integrate megaflow classifier.) broke exact match cache lookup, but it went undetected since there are no separate stats for EMC. This patch fixes the problem by changing the struct netdev_flow_key 'len' member to cover only the 'mf' member, not the whole netdev_flow_key, and ignoring the 'len' field in netdev_flow_key_equal. Comparison is still accurate, as the miniflow 'map' field encodes the length in the number of 1-bits, and the map is included in the comparison. Reported-by: Alex Wang <alexw@nicira.com> Signed-off-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Daniele Di Proietto <ddiproietto@vmware.com>	2014-10-17 17:03:13 -07:00
Jarno Rajahalme	0de8783a9d	lib/dpif-netdev: Integrate megaflow classifier. Megaflow inserts and removals are simplified: - No need for classifier internal mutex, as dpif-netdev already has a 'flow_mutex'. - Number of memory allocations/frees can be halved. - Lookup code path can rely on netdev_flow_key always having inline data. This will also be easier to simplify further when moving to per-thread megaflow classifiers in the future. Signed-off-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Alex Wang <alexw@nicira.com>	2014-10-17 09:37:11 -07:00
Pravin B Shelar	41ccaa249c	netdev-dpif: Add metadata to dpif-packet. Today dpif-netdev has single metadat for given batch, since one batch belongs to one port, but soon packets fro single tunnel ports can belong to different ports, so we need to have per packet metadata. Signed-off-by: Pravin B Shelar <pshelar@nicira.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com>	2014-10-09 14:12:11 -07:00
Daniele Di Proietto	154374a72b	dpif-netdev: reduce netdev_flow_key size Signed-off-by: Daniele Di Proietto <ddiproietto@vmware.com> Signed-off-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com>	2014-10-07 11:23:37 -07:00
Jarno Rajahalme	52a524eb20	lib/cmap: cmap_find_batch(). Batching the cmap find improves the memory behavior with large cmaps and can make searches twice as fast: $ tests/ovstest test-cmap benchmark 2000000 8 0.1 16 Benchmarking with n=2000000, 8 threads, 0.10% mutations, batch size 16: cmap insert: 533 ms cmap iterate: 57 ms batch search: 146 ms cmap destroy: 233 ms cmap insert: 552 ms cmap iterate: 56 ms cmap search: 299 ms cmap destroy: 229 ms hmap insert: 222 ms hmap iterate: 198 ms hmap search: 2061 ms hmap destroy: 209 ms Batch size 1 has small performance penalty, but all other batch sizes are faster than non-batched cmap_find(). The batch size 16 was experimentally found better than 8 or 32, so now classifier_lookup_miniflow_batch() performs the cmap find operations in batches of 16. Signed-off-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-10-06 15:33:47 -07:00
Jarno Rajahalme	55847abee8	lib/cmap: split up cmap_find(). This makes the following patch easier and cleans up the code. Explicit "inline" keywords seem necessary to prevent performance regression on cmap_find() with GCC 4.7 -O2. Signed-off-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-10-06 15:33:46 -07:00
Daniele Di Proietto	8bd89cdc06	dpif-netdev: Destroy pmd_thread cmap at exit Found by valgrind Signed-off-by: Daniele Di Proietto <ddiproietto@vmware.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com>	2014-10-03 15:04:15 -07:00
Daniele Di Proietto	88ace79b3e	dpif-netdev: fix dp_netdev_free() dp_netdev_free() must free 'dp->upcall_rwlock', but when upcalls are disabled (if the datapath is being freed upcalls should be disabled) 'dp->upcall_rwlock' is taken and freeing it causes an assertion to fail. This commit takes makes sure that the upcalls are disabled and releases 'dp->upcall_rwlock' before freeing it. A simple testcase is added to detect the failure. Signed-off-by: Daniele Di Proietto <ddiproietto@vmware.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com>	2014-10-03 15:04:15 -07:00
Daniele Di Proietto	ac8c20812b	dpif-netdev: Fix (packet) memory leaks in the slow path. If a packet didn't match a rule in the fast path classifier its memory was never freed. The issue was particularly clear with DPDK devices because it was not possible to process more than ~250000 DPDK mbufs in the slow path. This commit fixes the problem by: * calling dpif_packet_delete() if the upcalls are disabled * passing may_steal==true to dp_netdev_execute_actions() during normal upcall processing Signed-off-by: Daniele Di Proietto <ddiproietto@vmware.com> Acked-by: Alex Wang <alexw@nicira.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2014-09-19 16:50:17 -07:00
Alex Wang	f2eee18911	dpif-netdev: Allow multi-rx-queue, multi-pmd-thread configuration. This commits adds the multithreading functionality to OVS dpdk module. Users are able to create multiple pmd threads and set their cpu affinity via specifying the cpu mask string similar to the EAL '-c COREMASK' option. Also, the number of rx queues for each dpdk interface is made configurable to help distribution of rx packets among multiple pmd threads. Signed-off-by: Alex Wang <alexw@nicira.com> Acked-by: Pravin B Shelar <pshelar@nicira.com>	2014-09-19 15:59:36 -07:00

1 2 3 4 5 ...

350 Commits