From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.2 required=3.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS, URIBL_BLOCKED,USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id B1188C433E1 for ; Thu, 27 Aug 2020 18:43:28 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 9A86F2073A for ; Thu, 27 Aug 2020 18:43:28 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727017AbgH0SnV (ORCPT ); Thu, 27 Aug 2020 14:43:21 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:41532 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726120AbgH0SnV (ORCPT ); Thu, 27 Aug 2020 14:43:21 -0400 X-Greylist: delayed 754 seconds by postgrey-1.37 at lindbergh.monkeyblade.net; Thu, 27 Aug 2020 11:43:20 PDT Received: from caffeine.csclub.uwaterloo.ca (caffeine.csclub.uwaterloo.ca [IPv6:2620:101:f000:4901:c5c:0:caff:e12e]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id F290CC061264; Thu, 27 Aug 2020 11:43:20 -0700 (PDT) Received: by caffeine.csclub.uwaterloo.ca (Postfix, from userid 20367) id 2DCC6460411; Thu, 27 Aug 2020 14:30:39 -0400 (EDT) Date: Thu, 27 Aug 2020 14:30:39 -0400 To: Linux Kernel Mailing List Cc: netdev@vger.kernel.org, intel-wired-lan@lists.osuosl.org, Jeff Kirsher , Len Sorensen Subject: VRRP not working on i40e X722 S2600WFT Message-ID: <20200827183039.hrfnb63cxq3pmv4z@csclub.uwaterloo.ca> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: NeoMutt/20170113 (1.7.2) From: lsorense@csclub.uwaterloo.ca (Lennart Sorensen) Sender: netdev-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: netdev@vger.kernel.org I have hit a new problem with the X722 chipset (Intel R1304WFT server). VRRP simply does not work. When keepalived registers a vmac interface, and starts transmitting multicast packets with the vrp message, it never receives those packets from the peers, so all nodes think they are the master. tcpdump shows transmits, but no receives. If I stop keepalived, which deletes the vmac interface, then I start to receive the multicast packets from the other nodes. Even in promisc mode, tcpdump can't see those packets. So it seems the hardware is dropping all packets with a source mac that matches the source mac of the vmac interface, even when the destination is a multicast address that was subcribed to. This is clearly not proper behaviour. I tried a stock 5.8 kernel to check if a driver update helped, and updated the nvm firware to the latest 4.10 (which appears to be over a year old), and nothing changes the behaviour at all. Seems other people have hit this problem too: http://mails.dpdk.org/archives/users/2018-May/003128.html Unless someone has a way to fix this, we will have to change away from this hardware very quickly. The IPsec NAT RSS defect we could tolerate although didn't like, while this is just unworkable. Quite frustrated by this. Intel network hardware was always great, how did the X722 make it out in this state. -- Len Sorensen