Based on kernel version 6.11
. Page generated on 2024-09-24 08:21 EST
.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 | .. SPDX-License-Identifier: (GPL-2.0-only OR BSD-2-Clause) .. include:: <isonum.txt> =========================================== Network Flow Processor (NFP) Kernel Drivers =========================================== :Copyright: |copy| 2019, Netronome Systems, Inc. :Copyright: |copy| 2022, Corigine, Inc. Contents ======== - `Overview`_ - `Acquiring Firmware`_ - `Devlink Info`_ - `Configure Device`_ - `Statistics`_ Overview ======== This driver supports Netronome and Corigine's line of Network Flow Processor devices, including the NFP3800, NFP4000, NFP5000, and NFP6000 models, which are also incorporated in the companies' family of Agilio SmartNICs. The SR-IOV physical and virtual functions for these devices are supported by the driver. Acquiring Firmware ================== The NFP3800, NFP4000 and NFP6000 devices require application specific firmware to function. Application firmware can be located either on the host file system or in the device flash (if supported by management firmware). Firmware files on the host filesystem contain card type (`AMDA-*` string), media config etc. They should be placed in `/lib/firmware/netronome` directory to load firmware from the host file system. Firmware for basic NIC operation is available in the upstream `linux-firmware.git` repository. A more comprehensive list of firmware can be downloaded from the `Corigine Support site <https://www.corigine.com/DPUDownload.html>`_. Firmware in NVRAM ----------------- Recent versions of management firmware supports loading application firmware from flash when the host driver gets probed. The firmware loading policy configuration may be used to configure this feature appropriately. Devlink or ethtool can be used to update the application firmware on the device flash by providing the appropriate `nic_AMDA*.nffw` file to the respective command. Users need to take care to write the correct firmware image for the card and media configuration to flash. Available storage space in flash depends on the card being used. Dealing with multiple projects ------------------------------ NFP hardware is fully programmable therefore there can be different firmware images targeting different applications. When using application firmware from host, we recommend placing actual firmware files in application-named subdirectories in `/lib/firmware/netronome` and linking the desired files, e.g.:: $ tree /lib/firmware/netronome/ /lib/firmware/netronome/ ├── bpf │ ├── nic_AMDA0081-0001_1x40.nffw │ └── nic_AMDA0081-0001_4x10.nffw ├── flower │ ├── nic_AMDA0081-0001_1x40.nffw │ └── nic_AMDA0081-0001_4x10.nffw ├── nic │ ├── nic_AMDA0081-0001_1x40.nffw │ └── nic_AMDA0081-0001_4x10.nffw ├── nic_AMDA0081-0001_1x40.nffw -> bpf/nic_AMDA0081-0001_1x40.nffw └── nic_AMDA0081-0001_4x10.nffw -> bpf/nic_AMDA0081-0001_4x10.nffw 3 directories, 8 files You may need to use hard instead of symbolic links on distributions which use old `mkinitrd` command instead of `dracut` (e.g. Ubuntu). After changing firmware files you may need to regenerate the initramfs image. Initramfs contains drivers and firmware files your system may need to boot. Refer to the documentation of your distribution to find out how to update initramfs. Good indication of stale initramfs is system loading wrong driver or firmware on boot, but when driver is later reloaded manually everything works correctly. Selecting firmware per device ----------------------------- Most commonly all cards on the system use the same type of firmware. If you want to load a specific firmware image for a specific card, you can use either the PCI bus address or serial number. The driver will print which files it's looking for when it recognizes a NFP device:: nfp: Looking for firmware file in order of priority: nfp: netronome/serial-00-12-34-aa-bb-cc-10-ff.nffw: not found nfp: netronome/pci-0000:02:00.0.nffw: not found nfp: netronome/nic_AMDA0081-0001_1x40.nffw: found, loading... In this case if file (or link) called *serial-00-12-34-aa-bb-5d-10-ff.nffw* or *pci-0000:02:00.0.nffw* is present in `/lib/firmware/netronome` this firmware file will take precedence over `nic_AMDA*` files. Note that `serial-*` and `pci-*` files are **not** automatically included in initramfs, you will have to refer to documentation of appropriate tools to find out how to include them. Running firmware version ------------------------ The version of the loaded firmware for a particular <netdev> interface, (e.g. enp4s0), or an interface's port <netdev port> (e.g. enp4s0np0) can be displayed with the ethtool command:: $ ethtool -i <netdev> Firmware loading policy ----------------------- Firmware loading policy is controlled via three HWinfo parameters stored as key value pairs in the device flash: app_fw_from_flash Defines which firmware should take precedence, 'Disk' (0), 'Flash' (1) or the 'Preferred' (2) firmware. When 'Preferred' is selected, the management firmware makes the decision over which firmware will be loaded by comparing versions of the flash firmware and the host supplied firmware. This variable is configurable using the 'fw_load_policy' devlink parameter. abi_drv_reset Defines if the driver should reset the firmware when the driver is probed, either 'Disk' (0) if firmware was found on disk, 'Always' (1) reset or 'Never' (2) reset. Note that the device is always reset on driver unload if firmware was loaded when the driver was probed. This variable is configurable using the 'reset_dev_on_drv_probe' devlink parameter. abi_drv_load_ifc Defines a list of PF devices allowed to load FW on the device. This variable is not currently user configurable. Devlink Info ============ The devlink info command displays the running and stored firmware versions on the device, serial number and board information. Devlink info command example (replace PCI address):: $ devlink dev info pci/0000:03:00.0 pci/0000:03:00.0: driver nfp serial_number CSAAMDA2001-1003000111 versions: fixed: board.id AMDA2001-1003 board.rev 01 board.manufacture CSA board.model mozart running: fw.mgmt 22.10.0-rc3 fw.cpld 0x1000003 fw.app nic-22.09.0 chip.init AMDA-2001-1003 1003000111 stored: fw.bundle_id bspbundle_1003000111 fw.mgmt 22.10.0-rc3 fw.cpld 0x0 chip.init AMDA-2001-1003 1003000111 Configure Device ================ This section explains how to use Agilio SmartNICs running basic NIC firmware. Configure interface link-speed ------------------------------ The following steps explains how to change between 10G mode and 25G mode on Agilio CX 2x25GbE cards. The changing of port speed must be done in order, port 0 (p0) must be set to 10G before port 1 (p1) may be set to 10G. Down the respective interface(s):: $ ip link set dev <netdev port 0> down $ ip link set dev <netdev port 1> down Set interface link-speed to 10G:: $ ethtool -s <netdev port 0> speed 10000 $ ethtool -s <netdev port 1> speed 10000 Set interface link-speed to 25G:: $ ethtool -s <netdev port 0> speed 25000 $ ethtool -s <netdev port 1> speed 25000 Reload driver for changes to take effect:: $ rmmod nfp; modprobe nfp Configure interface Maximum Transmission Unit (MTU) --------------------------------------------------- The MTU of interfaces can temporarily be set using the iproute2, ip link or ifconfig tools. Note that this change will not persist. Setting this via Network Manager, or another appropriate OS configuration tool, is recommended as changes to the MTU using Network Manager can be made to persist. Set interface MTU to 9000 bytes:: $ ip link set dev <netdev port> mtu 9000 It is the responsibility of the user or the orchestration layer to set appropriate MTU values when handling jumbo frames or utilizing tunnels. For example, if packets sent from a VM are to be encapsulated on the card and egress a physical port, then the MTU of the VF should be set to lower than that of the physical port to account for the extra bytes added by the additional header. If a setup is expected to see fallback traffic between the SmartNIC and the kernel then the user should also ensure that the PF MTU is appropriately set to avoid unexpected drops on this path. Configure Forward Error Correction (FEC) modes ---------------------------------------------- Agilio SmartNICs support FEC mode configuration, e.g. Auto, Firecode Base-R, ReedSolomon and Off modes. Each physical port's FEC mode can be set independently using ethtool. The supported FEC modes for an interface can be viewed using:: $ ethtool <netdev> The currently configured FEC mode can be viewed using:: $ ethtool --show-fec <netdev> To force the FEC mode for a particular port, auto-negotiation must be disabled (see the `Auto-negotiation`_ section). An example of how to set the FEC mode to Reed-Solomon is:: $ ethtool --set-fec <netdev> encoding rs Auto-negotiation ---------------- To change auto-negotiation settings, the link must first be put down. After the link is down, auto-negotiation can be enabled or disabled using:: ethtool -s <netdev> autoneg <on|off> Statistics ========== Following device statistics are available through the ``ethtool -S`` interface: .. flat-table:: NFP device statistics :header-rows: 1 :widths: 3 1 11 * - Name - ID - Meaning * - dev_rx_discards - 1 - Packet can be discarded on the RX path for one of the following reasons: * The NIC is not in promisc mode, and the destination MAC address doesn't match the interfaces' MAC address. * The received packet is larger than the max buffer size on the host. I.e. it exceeds the Layer 3 MRU. * There is no freelist descriptor available on the host for the packet. It is likely that the NIC couldn't cache one in time. * A BPF program discarded the packet. * The datapath drop action was executed. * The MAC discarded the packet due to lack of ingress buffer space on the NIC. * - dev_rx_errors - 2 - A packet can be counted (and dropped) as RX error for the following reasons: * A problem with the VEB lookup (only when SR-IOV is used). * A physical layer problem that causes Ethernet errors, like FCS or alignment errors. The cause is usually faulty cables or SFPs. * - dev_rx_bytes - 3 - Total number of bytes received. * - dev_rx_uc_bytes - 4 - Unicast bytes received. * - dev_rx_mc_bytes - 5 - Multicast bytes received. * - dev_rx_bc_bytes - 6 - Broadcast bytes received. * - dev_rx_pkts - 7 - Total number of packets received. * - dev_rx_mc_pkts - 8 - Multicast packets received. * - dev_rx_bc_pkts - 9 - Broadcast packets received. * - dev_tx_discards - 10 - A packet can be discarded in the TX direction if the MAC is being flow controlled and the NIC runs out of TX queue space. * - dev_tx_errors - 11 - A packet can be counted as TX error (and dropped) for one for the following reasons: * The packet is an LSO segment, but the Layer 3 or Layer 4 offset could not be determined. Therefore LSO could not continue. * An invalid packet descriptor was received over PCIe. * The packet Layer 3 length exceeds the device MTU. * An error on the MAC/physical layer. Usually due to faulty cables or SFPs. * A CTM buffer could not be allocated. * The packet offset was incorrect and could not be fixed by the NIC. * - dev_tx_bytes - 12 - Total number of bytes transmitted. * - dev_tx_uc_bytes - 13 - Unicast bytes transmitted. * - dev_tx_mc_bytes - 14 - Multicast bytes transmitted. * - dev_tx_bc_bytes - 15 - Broadcast bytes transmitted. * - dev_tx_pkts - 16 - Total number of packets transmitted. * - dev_tx_mc_pkts - 17 - Multicast packets transmitted. * - dev_tx_bc_pkts - 18 - Broadcast packets transmitted. Note that statistics unknown to the driver will be displayed as ``dev_unknown_stat$ID``, where ``$ID`` refers to the second column above. |