mirror of https://github.com/F-Stack/f-stack.git
251 lines
9.1 KiB
ReStructuredText
251 lines
9.1 KiB
ReStructuredText
.. BSD LICENSE
|
||
Copyright(c) 2015 Netronome Systems, Inc. All rights reserved.
|
||
All rights reserved.
|
||
|
||
Redistribution and use in source and binary forms, with or without
|
||
modification, are permitted provided that the following conditions
|
||
are met:
|
||
|
||
* Redistributions of source code must retain the above copyright
|
||
notice, this list of conditions and the following disclaimer.
|
||
* Redistributions in binary form must reproduce the above copyright
|
||
notice, this list of conditions and the following disclaimer in
|
||
the documentation and/or other materials provided with the
|
||
distribution.
|
||
* Neither the name of Intel Corporation nor the names of its
|
||
contributors may be used to endorse or promote products derived
|
||
from this software without specific prior written permission.
|
||
|
||
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
|
||
"AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
|
||
LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
|
||
A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
|
||
OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
|
||
SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
|
||
LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
|
||
DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
|
||
THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
|
||
(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
|
||
OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
|
||
|
||
NFP poll mode driver library
|
||
============================
|
||
|
||
Netronome's sixth generation of flow processors pack 216 programmable
|
||
cores and over 100 hardware accelerators that uniquely combine packet,
|
||
flow, security and content processing in a single device that scales
|
||
up to 400 Gbps.
|
||
|
||
This document explains how to use DPDK with the Netronome Poll Mode
|
||
Driver (PMD) supporting Netronome's Network Flow Processor 6xxx
|
||
(NFP-6xxx).
|
||
|
||
Currently the driver supports virtual functions (VFs) only.
|
||
|
||
Dependencies
|
||
------------
|
||
|
||
Before using the Netronome's DPDK PMD some NFP-6xxx configuration,
|
||
which is not related to DPDK, is required. The system requires
|
||
installation of **Netronome's BSP (Board Support Package)** which includes
|
||
Linux drivers, programs and libraries.
|
||
|
||
If you have a NFP-6xxx device you should already have the code and
|
||
documentation for doing this configuration. Contact
|
||
**support@netronome.com** to obtain the latest available firmware.
|
||
|
||
The NFP Linux kernel drivers (including the required PF driver for the
|
||
NFP) are available on Github at
|
||
**https://github.com/Netronome/nfp-drv-kmods** along with build
|
||
instructions.
|
||
|
||
DPDK runs in userspace and PMDs uses the Linux kernel UIO interface to
|
||
allow access to physical devices from userspace. The NFP PMD requires
|
||
the **igb_uio** UIO driver, available with DPDK, to perform correct
|
||
initialization.
|
||
|
||
Building the software
|
||
---------------------
|
||
|
||
Netronome's PMD code is provided in the **drivers/net/nfp** directory.
|
||
Because Netronome´s BSP dependencies the driver is disabled by default
|
||
in DPDK build using **common_linuxapp configuration** file. Enabling the
|
||
driver or if you use another configuration file and want to have NFP
|
||
support, this variable is needed:
|
||
|
||
- **CONFIG_RTE_LIBRTE_NFP_PMD=y**
|
||
|
||
Once DPDK is built all the DPDK apps and examples include support for
|
||
the NFP PMD.
|
||
|
||
|
||
System configuration
|
||
--------------------
|
||
|
||
Using the NFP PMD is not different to using other PMDs. Usual steps are:
|
||
|
||
#. **Configure hugepages:** All major Linux distributions have the hugepages
|
||
functionality enabled by default. By default this allows the system uses for
|
||
working with transparent hugepages. But in this case some hugepages need to
|
||
be created/reserved for use with the DPDK through the hugetlbfs file system.
|
||
First the virtual file system need to be mounted:
|
||
|
||
.. code-block:: console
|
||
|
||
mount -t hugetlbfs none /mnt/hugetlbfs
|
||
|
||
The command uses the common mount point for this file system and it needs to
|
||
be created if necessary.
|
||
|
||
Configuring hugepages is performed via sysfs:
|
||
|
||
.. code-block:: console
|
||
|
||
/sys/kernel/mm/hugepages/hugepages-2048kB/nr_hugepages
|
||
|
||
This sysfs file is used to specify the number of hugepages to reserve.
|
||
For example:
|
||
|
||
.. code-block:: console
|
||
|
||
echo 1024 > /sys/kernel/mm/hugepages/hugepages-2048kB/nr_hugepages
|
||
|
||
This will reserve 2GB of memory using 1024 2MB hugepages. The file may be
|
||
read to see if the operation was performed correctly:
|
||
|
||
.. code-block:: console
|
||
|
||
cat /sys/kernel/mm/hugepages/hugepages-2048kB/nr_hugepages
|
||
|
||
The number of unused hugepages may also be inspected.
|
||
|
||
Before executing the DPDK app it should match the value of nr_hugepages.
|
||
|
||
.. code-block:: console
|
||
|
||
cat /sys/kernel/mm/hugepages/hugepages-2048kB/free_hugepages
|
||
|
||
The hugepages reservation should be performed at system initialization and
|
||
it is usual to use a kernel parameter for configuration. If the reservation
|
||
is attempted on a busy system it will likely fail. Reserving memory for
|
||
hugepages may be done adding the following to the grub kernel command line:
|
||
|
||
.. code-block:: console
|
||
|
||
default_hugepagesz=1M hugepagesz=2M hugepages=1024
|
||
|
||
This will reserve 2GBytes of memory using 2Mbytes huge pages.
|
||
|
||
Finally, for a NUMA system the allocation needs to be made on the correct
|
||
NUMA node. In a DPDK app there is a master core which will (usually) perform
|
||
memory allocation. It is important that some of the hugepages are reserved
|
||
on the NUMA memory node where the network device is attached. This is because
|
||
of a restriction in DPDK by which TX and RX descriptors rings must be created
|
||
on the master code.
|
||
|
||
Per-node allocation of hugepages may be inspected and controlled using sysfs.
|
||
For example:
|
||
|
||
.. code-block:: console
|
||
|
||
cat /sys/devices/system/node/node0/hugepages/hugepages-2048kB/nr_hugepages
|
||
|
||
For a NUMA system there will be a specific hugepage directory per node
|
||
allowing control of hugepage reservation. A common problem may occur when
|
||
hugepages reservation is performed after the system has been working for
|
||
some time. Configuration using the global sysfs hugepage interface will
|
||
succeed but the per-node allocations may be unsatisfactory.
|
||
|
||
The number of hugepages that need to be reserved depends on how the app uses
|
||
TX and RX descriptors, and packets mbufs.
|
||
|
||
#. **Enable SR-IOV on the NFP-6xxx device:** The current NFP PMD works with
|
||
Virtual Functions (VFs) on a NFP device. Make sure that one of the Physical
|
||
Function (PF) drivers from the above Github repository is installed and
|
||
loaded.
|
||
|
||
Virtual Functions need to be enabled before they can be used with the PMD.
|
||
Before enabling the VFs it is useful to obtain information about the
|
||
current NFP PCI device detected by the system:
|
||
|
||
.. code-block:: console
|
||
|
||
lspci -d19ee:
|
||
|
||
Now, for example, configure two virtual functions on a NFP-6xxx device
|
||
whose PCI system identity is "0000:03:00.0":
|
||
|
||
.. code-block:: console
|
||
|
||
echo 2 > /sys/bus/pci/devices/0000:03:00.0/sriov_numvfs
|
||
|
||
The result of this command may be shown using lspci again:
|
||
|
||
.. code-block:: console
|
||
|
||
lspci -d19ee: -k
|
||
|
||
Two new PCI devices should appear in the output of the above command. The
|
||
-k option shows the device driver, if any, that devices are bound to.
|
||
Depending on the modules loaded at this point the new PCI devices may be
|
||
bound to nfp_netvf driver.
|
||
|
||
#. **To install the uio kernel module (manually):** All major Linux
|
||
distributions have support for this kernel module so it is straightforward
|
||
to install it:
|
||
|
||
.. code-block:: console
|
||
|
||
modprobe uio
|
||
|
||
The module should now be listed by the lsmod command.
|
||
|
||
#. **To install the igb_uio kernel module (manually):** This module is part
|
||
of DPDK sources and configured by default (CONFIG_RTE_EAL_IGB_UIO=y).
|
||
|
||
.. code-block:: console
|
||
|
||
modprobe igb_uio.ko
|
||
|
||
The module should now be listed by the lsmod command.
|
||
|
||
Depending on which NFP modules are loaded, it could be necessary to
|
||
detach NFP devices from the nfp_netvf module. If this is the case the
|
||
device needs to be unbound, for example:
|
||
|
||
.. code-block:: console
|
||
|
||
echo 0000:03:08.0 > /sys/bus/pci/devices/0000:03:08.0/driver/unbind
|
||
|
||
lspci -d19ee: -k
|
||
|
||
The output of lspci should now show that 0000:03:08.0 is not bound to
|
||
any driver.
|
||
|
||
The next step is to add the NFP PCI ID to the IGB UIO driver:
|
||
|
||
.. code-block:: console
|
||
|
||
echo 19ee 6003 > /sys/bus/pci/drivers/igb_uio/new_id
|
||
|
||
And then to bind the device to the igb_uio driver:
|
||
|
||
.. code-block:: console
|
||
|
||
echo 0000:03:08.0 > /sys/bus/pci/drivers/igb_uio/bind
|
||
|
||
lspci -d19ee: -k
|
||
|
||
lspci should show that device bound to igb_uio driver.
|
||
|
||
#. **Using scripts to install and bind modules:** DPDK provides scripts which are
|
||
useful for installing the UIO modules and for binding the right device to those
|
||
modules avoiding doing so manually:
|
||
|
||
* **dpdk-setup.sh**
|
||
* **dpdk-devbind.py**
|
||
|
||
Configuration may be performed by running dpdk-setup.sh which invokes
|
||
dpdk-devbind.py as needed. Executing dpdk-setup.sh will display a menu of
|
||
configuration options.
|