2019-06-25 11:12:58 +00:00
|
|
|
.. SPDX-License-Identifier: BSD-3-Clause
|
|
|
|
Copyright(c) 2015-2016 Intel Corporation.
|
2017-04-21 10:43:26 +00:00
|
|
|
|
|
|
|
Keep Alive Sample Application
|
|
|
|
=============================
|
|
|
|
|
|
|
|
The Keep Alive application is a simple example of a
|
|
|
|
heartbeat/watchdog for packet processing cores. It demonstrates how
|
|
|
|
to detect 'failed' DPDK cores and notify a fault management entity
|
|
|
|
of this failure. Its purpose is to ensure the failure of the core
|
|
|
|
does not result in a fault that is not detectable by a management
|
|
|
|
entity.
|
|
|
|
|
|
|
|
|
|
|
|
Overview
|
|
|
|
--------
|
|
|
|
|
|
|
|
The application demonstrates how to protect against 'silent outages'
|
2021-02-05 08:48:47 +00:00
|
|
|
on packet processing cores. A Keep Alive Monitor Agent Core (main)
|
2017-04-21 10:43:26 +00:00
|
|
|
monitors the state of packet processing cores (worker cores) by
|
|
|
|
dispatching pings at a regular time interval (default is 5ms) and
|
|
|
|
monitoring the state of the cores. Cores states are: Alive, MIA, Dead
|
|
|
|
or Buried. MIA indicates a missed ping, and Dead indicates two missed
|
|
|
|
pings within the specified time interval. When a core is Dead, a
|
|
|
|
callback function is invoked to restart the packet processing core;
|
|
|
|
A real life application might use this callback function to notify a
|
|
|
|
higher level fault management entity of the core failure in order to
|
|
|
|
take the appropriate corrective action.
|
|
|
|
|
|
|
|
Note: Only the worker cores are monitored. A local (on the host) mechanism
|
|
|
|
or agent to supervise the Keep Alive Monitor Agent Core DPDK core is required
|
|
|
|
to detect its failure.
|
|
|
|
|
|
|
|
Note: This application is based on the :doc:`l2_forward_real_virtual`. As
|
|
|
|
such, the initialization and run-time paths are very similar to those
|
|
|
|
of the L2 forwarding application.
|
|
|
|
|
|
|
|
Compiling the Application
|
|
|
|
-------------------------
|
|
|
|
|
2018-05-15 09:49:22 +00:00
|
|
|
To compile the sample application see :doc:`compiling`.
|
2017-04-21 10:43:26 +00:00
|
|
|
|
2018-05-15 09:49:22 +00:00
|
|
|
The application is located in the ``l2fwd_keep_alive`` sub-directory.
|
2017-04-21 10:43:26 +00:00
|
|
|
|
|
|
|
Running the Application
|
|
|
|
-----------------------
|
|
|
|
|
|
|
|
The application has a number of command line options:
|
|
|
|
|
|
|
|
.. code-block:: console
|
|
|
|
|
2021-02-05 08:48:47 +00:00
|
|
|
./<build_dir>/examples/dpdk-l2fwd-keepalive [EAL options] \
|
2017-04-21 10:43:26 +00:00
|
|
|
-- -p PORTMASK [-q NQ] [-K PERIOD] [-T PERIOD]
|
|
|
|
|
|
|
|
where,
|
|
|
|
|
|
|
|
* ``p PORTMASK``: A hexadecimal bitmask of the ports to configure
|
|
|
|
|
|
|
|
* ``q NQ``: A number of queues (=ports) per lcore (default is 1)
|
|
|
|
|
|
|
|
* ``K PERIOD``: Heartbeat check period in ms(5ms default; 86400 max)
|
|
|
|
|
|
|
|
* ``T PERIOD``: statistics will be refreshed each PERIOD seconds (0 to
|
|
|
|
disable, 10 default, 86400 maximum).
|
|
|
|
|
2020-06-18 16:55:50 +00:00
|
|
|
To run the application in linux environment with 4 lcores, 16 ports
|
2017-04-21 10:43:26 +00:00
|
|
|
8 RX queues per lcore and a ping interval of 10ms, issue the command:
|
|
|
|
|
|
|
|
.. code-block:: console
|
|
|
|
|
2021-02-05 08:48:47 +00:00
|
|
|
./<build_dir>/examples/dpdk-l2fwd-keepalive -l 0-3 -n 4 -- -q 8 -p ffff -K 10
|
2017-04-21 10:43:26 +00:00
|
|
|
|
|
|
|
Refer to the *DPDK Getting Started Guide* for general information on
|
|
|
|
running applications and the Environment Abstraction Layer (EAL)
|
|
|
|
options.
|
|
|
|
|
|
|
|
|
|
|
|
Explanation
|
|
|
|
-----------
|
|
|
|
|
2022-09-02 04:40:05 +00:00
|
|
|
The following sections provide some explanation of the
|
2017-04-21 10:43:26 +00:00
|
|
|
Keep-Alive/'Liveliness' conceptual scheme. As mentioned in the
|
|
|
|
overview section, the initialization and run-time paths are very
|
|
|
|
similar to those of the :doc:`l2_forward_real_virtual`.
|
|
|
|
|
|
|
|
The Keep-Alive/'Liveliness' conceptual scheme:
|
|
|
|
|
|
|
|
* A Keep- Alive Agent Runs every N Milliseconds.
|
|
|
|
|
|
|
|
* DPDK Cores respond to the keep-alive agent.
|
|
|
|
|
|
|
|
* If keep-alive agent detects time-outs, it notifies the
|
|
|
|
fault management entity through a callback function.
|
|
|
|
|
|
|
|
The following sections provide some explanation of the code aspects
|
|
|
|
that are specific to the Keep Alive sample application.
|
|
|
|
|
|
|
|
The keepalive functionality is initialized with a struct
|
|
|
|
rte_keepalive and the callback function to invoke in the
|
|
|
|
case of a timeout.
|
|
|
|
|
|
|
|
.. code-block:: c
|
|
|
|
|
|
|
|
rte_global_keepalive_info = rte_keepalive_create(&dead_core, NULL);
|
|
|
|
if (rte_global_keepalive_info == NULL)
|
|
|
|
rte_exit(EXIT_FAILURE, "keepalive_create() failed");
|
|
|
|
|
|
|
|
The function that issues the pings keepalive_dispatch_pings()
|
|
|
|
is configured to run every check_period milliseconds.
|
|
|
|
|
|
|
|
.. code-block:: c
|
|
|
|
|
|
|
|
if (rte_timer_reset(&hb_timer,
|
|
|
|
(check_period * rte_get_timer_hz()) / 1000,
|
|
|
|
PERIODICAL,
|
|
|
|
rte_lcore_id(),
|
|
|
|
&rte_keepalive_dispatch_pings,
|
|
|
|
rte_global_keepalive_info
|
|
|
|
) != 0 )
|
|
|
|
rte_exit(EXIT_FAILURE, "Keepalive setup failure.\n");
|
|
|
|
|
|
|
|
The rest of the initialization and run-time path follows
|
2019-06-25 11:12:58 +00:00
|
|
|
the same paths as the L2 forwarding application. The only
|
2017-04-21 10:43:26 +00:00
|
|
|
addition to the main processing loop is the mark alive
|
|
|
|
functionality and the example random failures.
|
|
|
|
|
|
|
|
.. code-block:: c
|
|
|
|
|
|
|
|
rte_keepalive_mark_alive(&rte_global_keepalive_info);
|
|
|
|
cur_tsc = rte_rdtsc();
|
|
|
|
|
|
|
|
/* Die randomly within 7 secs for demo purposes.. */
|
|
|
|
if (cur_tsc - tsc_initial > tsc_lifetime)
|
|
|
|
break;
|
|
|
|
|
|
|
|
The rte_keepalive_mark_alive function simply sets the core state to alive.
|
|
|
|
|
|
|
|
.. code-block:: c
|
|
|
|
|
|
|
|
static inline void
|
|
|
|
rte_keepalive_mark_alive(struct rte_keepalive *keepcfg)
|
|
|
|
{
|
2018-05-15 09:49:22 +00:00
|
|
|
keepcfg->live_data[rte_lcore_id()].core_state = RTE_KA_STATE_ALIVE;
|
2017-04-21 10:43:26 +00:00
|
|
|
}
|