mir/ovs - ovs - Mike's Git repositories

mir/ovs

mirror of https://github.com/openvswitch/ovs synced 2025-08-29 13:27:59 +00:00

Author	SHA1	Message	Date
Russell Bryant	ee5315b9f9	timeval: Remove duplicate memset(). init_clock begins with a memset of 0 of the full clock struct. This memset at the end of a single struct member just makes extra sure that it's set to 0, which is unnecessary. Signed-off-by: Russell Bryant <rbryant@redhat.com> Signed-off-by: Ben Pfaff <blp@nicira.com>	2015-02-20 10:31:23 -08:00
Ben Pfaff	653ce9d684	timeval: Correctly report usage statistics in log_poll_interval(). Most of the information that timeval was reporting for long poll intervals was comparing per-thread with per-process statistics, which yielded nonsense a lot of the time. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Alex Wang <alexw@nicira.com>	2015-02-11 15:00:48 -08:00
Thomas Graf	e6211adce4	lib: Move vlog.h to <openvswitch/vlog.h> A new function vlog_insert_module() is introduced to avoid using list_insert() from the vlog.h header. Signed-off-by: Thomas Graf <tgraf@noironetworks.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-12-15 14:15:19 +01:00
Daniele Di Proietto	33c24cf9f4	timeval: Fix seq memory leak 'timewarp_seq' should be initialized only once, while init_clock() is called multiple times (once for each clock instance). Found by valgrind Signed-off-by: Daniele Di Proietto <ddiproietto@vmware.com> Acked-by: Jarno Rajahalme <jrajahalme@nicira.com>	2014-10-03 15:04:15 -07:00
Alex Wang	74467d5c88	unixctl: Make command description all lowercase. Signed-off-by: Alex Wang <alexw@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-09-11 13:49:13 -07:00
Jarno Rajahalme	fc4a1204b7	lib/timeval: Use relaxed atomics also when writing on 'slow_path'. Signed-off-by: Jarno Rajahalme <jrajahalme@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-08-29 10:34:53 -07:00
Gurucharan Shetty	4537a423fb	timeval: Initialize 'unix_epoch' for Windows correctly. Till now, we were initializing 'unix_epoch' through time_init(). But if there was a call directly to xgettimeofday(), we would miss the initialization causing overflows. This commit fixes it by pre-calculating the value and assigning it globally. Also add-in a missing return statement. Reported-by: Alessandro Pilotti <apilotti@cloudbasesolutions.com> Reported-by: Linda Sun <lsun@vmware.com> Signed-off-by: Gurucharan Shetty <gshetty@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-07-09 16:30:35 -07:00
Gurucharan Shetty	8661af7986	timeval: Provide a variation for time/warp command. The new command is of the form 'time/warp LARGE_MSECS MSECS'. It advances the current monotonic time by LARGE_MSECS. This is done MSECS at a time in each run of the main thread. This gives other threads time to run after the clock has been advanced by MSECS. The old command would continue to work. Rationale: On Windows, process creation is slower. When we have tests that run 'ovs-appctl time/warp MSECS' hundreds of times in a for loop, the time it takes to complete the test increases. This is specially true for bfd tests. For e.g, the 11 bfd tests would take 3.5 minutes to complete before this change and now takes a little less than 2 minutes. Signed-off-by: Gurucharan Shetty <gshetty@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-06-13 11:31:17 -07:00
Gurucharan Shetty	2b89438deb	timeval: Workaround strftime bug in VS 2013. Visual Studio 2013's behavior is to crash when 0 is passed as second argument to strftime. Signed-off-by: Gurucharan Shetty <gshetty@nicira.com>	2014-05-14 14:45:49 -07:00
Ben Pfaff	3308c69622	timeval: Preserve quiescence across time_poll(). Otherwise ovsrcu_synchronize() busy-waits in its loop because its poll_block() un-quiesces, causing the global_seqno to increase, which is what it waits for. Reported-by: Alex Wang <alexw@nicira.com> Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Alex Wang <alexw@nicira.com>	2014-04-28 15:25:48 -07:00
Ben Pfaff	0f2ea84841	ovs-rcu: New library. RCU allows multiple threads to read objects in parallel without any performance penalty. The following commit will introduce the first use. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Andy Zhou <azhou@nicira.com>	2014-03-18 16:34:28 -07:00
Gurucharan Shetty	eb1ddf9852	timeval: Sleep instead of poll(). The WSAPoll() function, which is similar to poll() doesnot simply sleep when the fd array is NULL. So use Sleep() instead. Signed-off-by: Gurucharan Shetty <gshetty@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-03-13 09:18:17 -07:00
Gurucharan Shetty	bae94bc773	timeval: gettimeofday() for Windows. Use GetSystemTimePreciseAsFileTime() for gettimeofday(). GetSystemTimePreciseAsFileTime() provides the result that is more high resolution than just the microsecond that gittimeofday() in Linux provides. So we need to remove some additional precision. Signed-off-by: Gurucharan Shetty <gshetty@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-03-05 08:38:18 -08:00
Gurucharan Shetty	48f8613f26	timeval: clock_gettime() for Windows. QueryPerformanceCounter() retrieves the current value of the performance counter, which is a high resolution (<1us) time stamp that can be used for time-interval measurements. So, use it for MONOTONIC clock. The GetSystemTimePreciseAsFileTime() function retrieves the current system date and time with the highest possible level of precision (<1us). Use it for real time clock. This function returns a counter representing the number of 100-nanosecond intervals since January 1, 1601. To make it compatible with Linux CLOCK_REALTIME, we need to calculate the 100-nanoseconds counter value till 01/01/1970. An upcoming commit implements gettimeofday() using the same clock, so, carve out a function. Signed-off-by: Gurucharan Shetty <gshetty@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-03-05 08:38:18 -08:00
Gurucharan Shetty	84a6cbae36	fatal-signal: Fatal signal handling for Windows. Windows does not have a SIGHUP or SIGALRM. It does have a SIGINT and SIGTERM. The documentation at msdn says that SIGINT is not supported for win32 applications because WIN32 operating systems generate a new thread to specifically handle Ctrl+C. This commit handles SIGTERM for Windows. The documentation also states that nothing generates SIGTERM in Windows, but one can use raise(SIGTERM) to manage it. The idea for handling SIGTERM for Windows is to just have a place holder if there is need to raise() a signal for some other purpose. We use SIGALRM in timeval.c if we wake up from a sleep after 'deadline'. For Windows, print an error message and then use SIGTERM. There is an atexit() function for Windows, so we can call cleanup functions during exit. An upcoming commit separately handles Ctrl+C so that we can call clean up functions for that use case. Signed-off-by: Gurucharan Shetty <gshetty@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com>	2014-02-26 12:34:28 -08:00
Linda Sun	4ca828d713	poll-loop: Port to Windows. Use WaitForMultipleObjects for polling on windows. This works on all kinds of objects, e.g. sockets, files, especially ioctl calls to the kernel. poll_fd_wait_event() is used if events need to be passed to pollfds. latch is signaled with event, to be waited/polled by WaitForMultipleObjects() as well. Changed array of fds to hmap to check for duplicate fds. Signed-off-by: Linda Sun <lsun@vmware.com> Signed-off-by: Ben Pfaff <blp@nicira.com>	2014-01-17 16:11:20 -08:00
YAMAMOTO Takashi	ac3c0be096	timeval: Workaround for threaded test failures BFD tests have the code like the following. # wait for a while to stablize everything. for i in `seq 0 9`; do ovs-appctl time/warp 500; done They no longer work as intended because BFD code is run in a separate monitor thread these days. The loop merely "warp" the time by 5000. The monitor thread should have been woken at least once, but it's far from "wait for a while to stablize everything." This commit mitigates the problem by sleeping a little in the appctl handler. This is not ideal but makes BFD tests success on my environment. Signed-off-by: YAMAMOTO Takashi <yamamoto@valinux.co.jp> Signed-off-by: Ben Pfaff <blp@nicira.com>	2013-12-20 07:56:49 -08:00
Ben Pfaff	e2afa7cd68	timeval: Call coverage_clear() before coverage_log() in time_poll(). time_poll() calls log_poll_interval(), which in some circumstances calls coverage_log(). Before this commit, time_poll() also called coverage_clear() after log_poll_interval(). This made sense before commit 857165b5fd26 (coverage: Make thread-safe.), because coverage_log() would log the most recent main loop's coverage counters separately and calling coverage_clear() beforehand would zero out those counters. However, it doesn't make sense any longer because the most recent loop's counters are no longer separately logged and in fact this practice now means that the most recent loop's counters are omitted from the logged counters. Therefore, this commit moves the call to coverage_clear() earlier, so that the most recent loop's counters are included. Signed-off-by: Ben Pfaff <blp@nicira.com>	2013-10-09 17:02:57 -07:00
Alex Wang	53d98a1e9c	timeval: Wake up all threads when time is warped. This commit makes the main thread wake up all other threads when time is warped. Signed-off-by: Alex Wang <alexw@nicira.com> Signed-off-by: Ethan Jackson <ethan@nicira.com> Acked-by: Ben Pfaff <blp@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-10-08 17:57:58 -07:00
Alex Wang	98cf638b19	coverage: Reimplement the "ovs-appctl coverage/show" command. This commit changes the "ovs-appctl coverage/show" command to show the the averaged per-second rates for the last few seconds, the last minute and the last hour, and the total counts of all of the coverage counters. Signed-off-by: Alex Wang <alexw@nicira.com> Signed-off-by: Ethan Jackson <ethan@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-10-01 13:08:46 -07:00
Ben Pfaff	e6249c68a5	timeval: Replace rwlock by mutex. It's only held briefly now and in general a mutex tends to be preferred for that case. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-09-13 14:30:33 -07:00
Ben Pfaff	04b331cdca	timeval: Restore ability to warp time forward when time is not stopped. Commit 31ef9f5178dee18 (timeval: Remove CACHE_TIME scheme.) inadvertently removed the ability to warp time forward, for use in tests, except when time is stopped. This fixes the problem. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-09-13 14:30:05 -07:00
Ben Pfaff	3e509ec5ed	timeval: Add Clang thread-safety annotations, fix unimportant violation. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-09-13 13:34:17 -07:00
Paul Ingram	2b31d8e713	vlog: Report timestamps in millisecond resolution in log messages. To make debugging easier. Signed-off-by: Ben Pfaff <blp@nicira.com> Signed-off-by: Paul Ingram <pingram@nicira.com>	2013-09-13 09:11:46 -07:00
YAMAMOTO Takashi	223fad326c	lib/timeval: don't forget to initialize a rwlock Commit 31ef9f5178 (timeval: Remove CACHE_TIME scheme.) removed initialization of a rwlock which is still used for some operations. This restores it. Signed-off-by: Ben Pfaff <blp@nicira.com>	2013-09-09 09:53:26 -07:00
Alex Wang	31ef9f5178	timeval: Remove CACHE_TIME scheme. This commit removes the CACHE_TIME scheme from timeval module. This is for eliminating the lock contention over the read/write lock of the cached time. To get the time, the thread now will directly do the system call 'clock_gettime()'. As a side effect, timer can only be warpped after timer is stopped by 'appctl time/stop' command. Signed-off-by: Alex Wang <alexw@nicira.com> Signed-off-by: Ethan Jackson <ethan@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-09-04 16:57:40 -07:00
Ben Pfaff	2ba4f163d9	ovs-thread: Add support for globally visible per-thread data. DEFINE_PER_THREAD_DATA always declared its data item as "static", meaning that it was only directly visible within a single translation unit. This commit adds additional forms of per-thread data that allow the data to be accessible from multiple translation units. Signed-off-by: Ben Pfaff <blp@nicira.com>	2013-08-08 13:18:10 -07:00
Ethan Jackson	97be153858	clang: Add annotations for thread safety check. This commit adds annotations for thread safety check. And the check can be conducted by using -Wthread-safety flag in clang. Co-authored-by: Alex Wang <alexw@nicira.com> Signed-off-by: Alex Wang <alexw@nicira.com> Signed-off-by: Ethan Jackson <ethan@nicira.com> Signed-off-by: Ben Pfaff <blp@nicira.com>	2013-07-30 21:30:45 -07:00
Ben Pfaff	4c694ff745	timeval: Make CPU usage and wakeup tracking per-thread. This should make the "timeval" module thread-safe, except for its calls into vlog (because vlog is not yet thread-safe). Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ed Maste <emaste@freebsd.org>	2013-07-12 13:52:29 -07:00
Ben Pfaff	75f9c912f2	timeval: Make reading the current time thread-safe. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ed Maste <emaste@freebsd.org>	2013-07-12 13:47:10 -07:00
Ben Pfaff	6f2302b06c	timeval: Fix typo in comment. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ed Maste <emaste@freebsd.org>	2013-07-11 17:04:52 -07:00
Ben Pfaff	94f3e5269e	timeval: Remove backtrace feature. The backtrace feature of timeval is useful because it provides a "poor man's profile" view of Open vSwitch. But it is not likely to be useful in a multithreaded process, because signal delivery doesn't necessarily follow the profile when there is more than one thread. (A signal in a multithreaded process are delivered to an arbitrary thread.) Another problem with the backtrace feature is that it is difficult for format_backtraces() to synchronize properly with the signal handler in a multithreaded process. In a single-threaded process, it can just block the signal handler, but in a multithreaded process this does not prevent signal delivery to threads other than the one running format_backtrace(). Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ed Maste <emaste@freebsd.org>	2013-07-11 17:02:12 -07:00
Ben Pfaff	bc7ad7d474	timeval: New function xclock_gettime(). Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ed Maste <emaste@freebsd.org>	2013-07-11 16:50:15 -07:00
Ben Pfaff	10a89ef04d	Replace all uses of strerror() by ovs_strerror(), for thread safety. Signed-off-by: Ben Pfaff <blp@nicira.com>	2013-06-28 16:09:38 -07:00
Ben Pfaff	728a8b141f	ovs-thread: Add support for various thread-related assertions. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-06-28 16:09:37 -07:00
Ben Pfaff	5ad72ef5b6	timeval: Remove time_disable_restart(), time_enable_restart() functions. These functions will not have the same useful effect when Open vSwitch becomes multithreaded, because time_disable_restart() will disable time advancing for every thread, not just for the thread that calls it. These functions are no longer used, so this commit removes them. Signed-off-by: Ben Pfaff <blp@nicira.com>	2013-06-06 16:59:51 -07:00
Ben Pfaff	a19b84f4a6	timeval: Do not block SIGALRM around setting 'deadline' in time_alarm(). There is no need to do so because the signal handler does not read or write 'deadline'. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-06-05 14:58:01 -07:00
Ben Pfaff	92829687b4	Use pthread_sigmask() in place of sigprocmask(), for thread safety. POSIX says that multithreaded programs must not use sigprocmask() but must use pthread_sigmask() instead. This commit makes that replacement. The actual use of signals in Open vSwitch is still not thread safe following this commit, but this change is a necessary prerequisite for fixing the other problems. Signed-off-by: Ben Pfaff <blp@nicira.com>	2013-05-02 13:55:45 -07:00
Ben Pfaff	93456ec8b0	timeval: Check for HAVE_BACKTRACE instead of HAVE_EXECINFO_H. Other code in the tree uses HAVE_BACKTRACE and then blindly includes <execinfo.h> if it is present, so this doesn't make anything worse. Once we do that, HAVE_EXECINFO_H has no further users, so this commit also removes the check for <execinfo.h> Reported-by: YAMAMOTO Takashi <yamt@mwd.biglobe.ne.jp> Signed-off-by: Ben Pfaff <blp@nicira.com>	2013-04-24 08:59:11 -07:00
Ben Pfaff	4906f197e9	timeval: Avoid backtrace() from signal handler on x86-64. backtrace() is really useful, but it is not signal safe everywhere. We need to reassess whether it is reasonable to use it anywhere, but immediately we need to disable it on x86-64 (with glibc) because it is causing segfaults in testing. Bug #15497. Reported-by: Ram Jothikumar <rjothikumar@vmware.com> Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-03-07 17:13:49 -08:00
Ben Pfaff	6e97c6506e	timeval: Don't issue poll interval warnings when we warp time. It's not meaningful in that case. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-02-11 13:22:30 -08:00
Ben Pfaff	75b0b752d8	timeval: Avoid unnecessary integer overflow in time_alarm(). Durations longer than 4294967 seconds would unnecessarily overflow in the multiplication here. Found by Coverity. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-02-01 14:27:20 -08:00
Ben Pfaff	cb22974d77	Replace most uses of assert by ovs_assert. This is a straight search-and-replace, except that I also removed #include <assert.h> from each file where there were no assert calls left. Signed-off-by: Ben Pfaff <blp@nicira.com> Acked-by: Ethan Jackson <ethan@nicira.com>	2013-01-16 16:03:37 -08:00
Ethan Jackson	543a550837	timeval: Fix occasional backtrace() related deadlock. Occasionally, backtrace() will deadlock in the signal handler because it does some non signal safe initialization. Specifically, it opens a shared object. As a work around, this patch forces backtrace() to run outside of a signal handler, so that future calls will perform as expected. Signed-off-by: Ethan Jackson <ethan@nicira.com>	2012-10-26 14:44:33 -07:00
Ethan Jackson	f43e80e023	poll-loop: Log backtraces when CPU usage is high. Often when debugging Open vSwitch, one will see in the logs that CPU usage has been high for some period of time, but it's totally unclear why. In an attempt to remedy the situation, this patch logs backtraces taken at regular intervals as a poor man's profiling alternative. Signed-off-by: Ethan Jackson <ethan@nicira.com>	2012-10-25 11:14:07 -07:00
Ethan Jackson	7d1563e961	timeval: Coalesce backtraces with counts. With this patch, `ovs-appctl backtrace` will return a unique list of backtraces and a count of how many times it has been recorded. This work had previously been done by ovs-parse-backtrace. However, in future patches poll-loop will begin logging backtraces as a matter of course. At this point, coalescing the backtraces will help keep these log messages brief. Signed-off-by: Ethan Jackson <ethan@nicira.com>	2012-10-25 11:14:07 -07:00
Ethan Jackson	fdce775cc5	timeval: Take a backtrace on each SIGALRM. With this patch, timeval will take a backtrace with each SIGALRM allowing it to retrieve a profiling snapshot instantly. This will be useful in future patches when backtraces are logged. Signed-off-by: Ethan Jackson <ethan@nicira.com>	2012-10-25 09:41:37 -07:00
Ethan Jackson	d7a291ac1c	timeval: Simplify poll interval logging. log_poll_interval() is a little bit too aggressive, and is therefore less useful than it could be. This patch removes the mean interval calculation, and simply logs if the poll loop took longer than 1 second instead. Signed-off-by: Ethan Jackson <ethan@nicira.com>	2012-10-20 17:26:38 -07:00
Ethan Jackson	61a7b1e6fe	timeval: Block SIGALRM when sleeping. Commit 00a16895 (timeval: Don't require signals for time_alarm().) Incorrectly disabled signals when when CACHE_TIME was disabled. In fact, the reverse was correct. As a result of this bug, OVS would wake once every 100ms unnecessarily. It shouldn't have affected correctness otherwise. Signed-off-by: Ethan Jackson <ethan@nicira.com>	2012-10-16 15:02:45 -07:00
Ethan Jackson	9d232a6d57	timeval: Add new "backtrace" appctl command. Often, it can be quite difficult to debug performance issues in Open vSwitch. Typically one needs to run something like gprof, but that requires rebuilding and installing on the affected system which is often problematic. This patch adds a light weight profiling solution which can be used in these situations. The ovs-appctl backtrace command prints out backtraces taken at 100 millisecond intervals over a 5 second period of time. It is currently only supported on systems which have the execinfo library and enable time caching. Signed-off-by: Ethan Jackson <ethan@nicira.com>	2012-10-14 19:36:39 -07:00

1 2

83 Commits