linux-dev - Linux kernel development work

Age	Commit message (Collapse)	Author	Files	Lines
2022-01-06	veth: Do not record rx queue hint in veth_xmit	Daniel Borkmann	1	-1/+0
	Laurent reported that they have seen a significant amount of TCP retransmissions at high throughput from applications residing in network namespaces talking to the outside world via veths. The drops were seen on the qdisc layer (fq_codel, as per systemd default) of the phys device such as ena or virtio_net due to all traffic hitting a _single_ TX queue _despite_ multi-queue device. (Note that the setup was _not_ using XDP on veths as the issue is generic.) More specifically, after edbea9220251 ("veth: Store queue_mapping independently of XDP prog presence") which made it all the way back to v4.19.184+, skb_record_rx_queue() would set skb->queue_mapping to 1 (given 1 RX and 1 TX queue by default for veths) instead of leaving at 0. This is eventually retained and callbacks like ena_select_queue() will also pick single queue via netdev_core_pick_tx()'s ndo_select_queue() once all the traffic is forwarded to that device via upper stack or other means. Similarly, for others not implementing ndo_select_queue() if XPS is disabled, netdev_pick_tx() might call into the skb_tx_hash() and check for prior skb_rx_queue_recorded() as well. In general, it is a _bad_ idea for virtual devices like veth to mess around with queue selection [by default]. Given dev->real_num_tx_queues is by default 1, the skb->queue_mapping was left untouched, and so prior to edbea9220251 the netdev_core_pick_tx() could do its job upon __dev_queue_xmit() on the phys device. Unbreak this and restore prior behavior by removing the skb_record_rx_queue() from veth_xmit() altogether. If the veth peer has an XDP program attached, then it would return the first RX queue index in xdp_md->rx_queue_index (unless configured in non-default manner). However, this is still better than breaking the generic case. Fixes: edbea9220251 ("veth: Store queue_mapping independently of XDP prog presence") Fixes: 638264dc9022 ("veth: Support per queue XDP ring") Reported-by: Laurent Bernaille <laurent.bernaille@datadoghq.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Cc: Maciej Fijalkowski <maciej.fijalkowski@intel.com> Cc: Toshiaki Makita <toshiaki.makita1@gmail.com> Cc: Eric Dumazet <eric.dumazet@gmail.com> Cc: Paolo Abeni <pabeni@redhat.com> Cc: John Fastabend <john.fastabend@gmail.com> Cc: Willem de Bruijn <willemb@google.com> Acked-by: John Fastabend <john.fastabend@gmail.com> Reviewed-by: Eric Dumazet <edumazet@google.com> Acked-by: Toshiaki Makita <toshiaki.makita1@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	ethernet: ibmveth: use default_groups in kobj_type	Greg Kroah-Hartman	1	-1/+2
	There are currently 2 ways to create a set of sysfs files for a kobj_type, through the default_attrs field, and the default_groups field. Move the ibmveth sysfs code to use default_groups field which has been the preferred way since aa30f47cf666 ("kobject: Add support for default attribute groups to kobj_type") so that we can soon get rid of the obsolete default_attrs field. Cc: Michael Ellerman <mpe@ellerman.id.au> Cc: Benjamin Herrenschmidt <benh@kernel.crashing.org> Cc: Paul Mackerras <paulus@samba.org> Cc: Cristobal Forno <cforno12@linux.ibm.com> Cc: "David S. Miller" <davem@davemloft.net> Cc: Jakub Kicinski <kuba@kernel.org> Cc: linuxppc-dev@lists.ozlabs.org Cc: netdev@vger.kernel.org Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Reviewed-by: Tyrel Datwyler <tyreld@linux.ibm.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	sfc: Use swap() instead of open coding it	Jiapeng Chong	1	-10/+4
	Clean the following coccicheck warning: ./drivers/net/ethernet/sfc/efx_channels.c:870:36-37: WARNING opportunity for swap(). ./drivers/net/ethernet/sfc/efx_channels.c:824:36-37: WARNING opportunity for swap(). Reported-by: Abaci Robot <abaci@linux.alibaba.com> Signed-off-by: Jiapeng Chong <jiapeng.chong@linux.alibaba.com> Acked-by: Martin Habets <habetsm.xilinx@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	ethtool: use phydev variable	Tom Rix	1	-4/+4
	In ethtool_get_phy_stats(), the phydev varaible is set to dev->phydev but dev->phydev is still used. Replace dev->phydev uses with phydev. Signed-off-by: Tom Rix <trix@redhat.com> Reviewed-by: Andrew Lunn <andrew@lunn.ch> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	net: macb: use .mac_select_pcs() interface	Russell King (Oracle)	2	-15/+14
	Convert the PCS selection to use mac_select_pcs, which allows the PCS to perform any validation it needs. We must use separate phylink_pcs instances for the USX and SGMII PCS, rather than just changing the "ops" pointer before re-setting it to phylink as this interface queries the PCS, rather than requesting it to be changed. Acked-by: Nicolas Ferre <nicolas.ferre@microchip.com> Signed-off-by: Russell King (Oracle) <rmk+kernel@armlinux.org.uk> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	gro: add ability to control gro max packet size	Coco Li	6	-1/+41
	Eric Dumazet suggested to allow users to modify max GRO packet size. We have seen GRO being disabled by users of appliances (such as wifi access points) because of claimed bufferbloat issues, or some work arounds in sch_cake, to split GRO/GSO packets. Instead of disabling GRO completely, one can chose to limit the maximum packet size of GRO packets, depending on their latency constraints. This patch adds a per device gro_max_size attribute that can be changed with ip link command. ip link set dev eth0 gro_max_size 16000 Suggested-by: Eric Dumazet <edumazet@google.com> Signed-off-by: Coco Li <lixiaoyan@google.com> Signed-off-by: Eric Dumazet <edumazet@google.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	net: fix SOF_TIMESTAMPING_BIND_PHC to work with multiple sockets	Miroslav Lichvar	3	-13/+18
	When multiple sockets using the SOF_TIMESTAMPING_BIND_PHC flag received a packet with a hardware timestamp (e.g. multiple PTP instances in different PTP domains using the UDPv4/v6 multicast or L2 transport), the timestamps received on some sockets were corrupted due to repeated conversion of the same timestamp (by the same or different vclocks). Fix ptp_convert_timestamp() to not modify the shared skb timestamp and return the converted timestamp as a ktime_t instead. If the conversion fails, return 0 to not confuse the application with timestamps corresponding to an unexpected PHC. Fixes: d7c088265588 ("net: socket: support hardware timestamp conversion to PHC bound") Signed-off-by: Miroslav Lichvar <mlichvar@redhat.com> Cc: Yangbo Lu <yangbo.lu@nxp.com> Cc: Richard Cochran <richardcochran@gmail.com> Acked-by: Richard Cochran <richardcochran@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	net: dsa: warn about dsa_port and dsa_switch bit fields being non atomic	Vladimir Oltean	1	-0/+8
	As discussed during review here: https://patchwork.kernel.org/project/netdevbpf/patch/20220105132141.2648876-3-vladimir.oltean@nxp.com/ we should inform developers about pitfalls of concurrent access to the boolean properties of dsa_switch and dsa_port, now that they've been converted to bit fields. No other measure than a comment needs to be taken, since the code paths that update these bit fields are not concurrent with each other. Suggested-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	net: dsa: don't enumerate dsa_switch and dsa_port bit fields using commas	Vladimir Oltean	1	-58/+56
	This is a cosmetic incremental fixup to commits 7787ff776398 ("net: dsa: merge all bools of struct dsa_switch into a single u32") bde82f389af1 ("net: dsa: merge all bools of struct dsa_port into a single u8") The desire to make this change was enunciated after posting these patches here: https://patchwork.kernel.org/project/netdevbpf/cover/20220105132141.2648876-1-vladimir.oltean@nxp.com/ but due to a slight timing overlap (message posted at 2:28 p.m. UTC, merge commit is at 2:46 p.m. UTC), that comment was missed and the changes were applied as-is. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	net: dsa: setup master before ports	Vladimir Oltean	1	-10/+13
	It is said that as soon as a network interface is registered, all its resources should have already been prepared, so that it is available for sending and receiving traffic. One of the resources needed by a DSA slave interface is the master. dsa_tree_setup -> dsa_tree_setup_ports -> dsa_port_setup -> dsa_slave_create -> register_netdevice -> dsa_tree_setup_master -> dsa_master_setup -> sets up master->dsa_ptr, which enables reception Therefore, there is a short period of time after register_netdevice() during which the master isn't prepared to pass traffic to the DSA layer (master->dsa_ptr is checked by eth_type_trans). Same thing during unregistration, there is a time frame in which packets might be missed. Note that this change opens us to another race: dsa_master_find_slave() will get invoked potentially earlier than the slave creation, and later than the slave deletion. Since dp->slave starts off as a NULL pointer, the earlier calls aren't a problem, but the later calls are. To avoid use-after-free, we should zeroize dp->slave before calling dsa_slave_destroy(). In practice I cannot really test real life improvements brought by this change, since in my systems, netdevice creation races with PHY autoneg which takes a few seconds to complete, and that masks quite a few races. Effects might be noticeable in a setup with fixed links all the way to an external system. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	net: dsa: first set up shared ports, then non-shared ports	Vladimir Oltean	1	-13/+37
	After commit a57d8c217aad ("net: dsa: flush switchdev workqueue before tearing down CPU/DSA ports"), the port setup and teardown procedure became asymmetric. The fact of the matter is that user ports need the shared ports to be up before they can be used for CPU-initiated termination. And since we register net devices for the user ports, those won't be functional until we also call the setup for the shared (CPU, DSA) ports. But we may do that later, depending on the port numbering scheme of the hardware we are dealing with. It just makes sense that all shared ports are brought up before any user port is. I can't pinpoint any issue due to the current behavior, but let's change it nonetheless, for consistency's sake. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	net: dsa: hold rtnl_mutex when calling dsa_master_{setup,teardown}	Vladimir Oltean	2	-2/+10
	DSA needs to simulate master tracking events when a binding is first with a DSA master established and torn down, in order to give drivers the simplifying guarantee that ->master_state_change calls are made only when the master's readiness state to pass traffic changes. master_state_change() provide a operational bool that DSA driver can use to understand if DSA master is operational or not. To avoid races, we need to block the reception of NETDEV_UP/NETDEV_CHANGE/NETDEV_GOING_DOWN events in the netdev notifier chain while we are changing the master's dev->dsa_ptr (this changes what netdev_uses_dsa(dev) reports). The dsa_master_setup() and dsa_master_teardown() functions optionally require the rtnl_mutex to be held, if the tagger needs the master to be promiscuous, these functions call dev_set_promiscuity(). Move the rtnl_lock() from that function and make it top-level. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Reviewed-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	net: dsa: stop updating master MTU from master.c	Vladimir Oltean	1	-24/+1
	At present there are two paths for changing the MTU of the DSA master. The first is: dsa_tree_setup -> dsa_tree_setup_ports -> dsa_port_setup -> dsa_slave_create -> dsa_slave_change_mtu -> dev_set_mtu(master) The second is: dsa_tree_setup -> dsa_tree_setup_master -> dsa_master_setup -> dev_set_mtu(dev) So the dev_set_mtu() call from dsa_master_setup() has been effectively superseded by the dsa_slave_change_mtu(slave_dev, ETH_DATA_LEN) that is done from dsa_slave_create() for each user port. The later function also updates the master MTU according to the largest user port MTU from the tree. Therefore, updating the master MTU through a separate code path isn't needed. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Reviewed-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	net: dsa: merge rtnl_lock sections in dsa_slave_create	Vladimir Oltean	1	-3/+1
	Currently dsa_slave_create() has two sequences of rtnl_lock/rtnl_unlock in a row. Remove the rtnl_unlock() and rtnl_lock() in between, such that the operation can execute slighly faster. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Reviewed-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-06	net: dsa: reorder PHY initialization with MTU setup in slave.c	Vladimir Oltean	1	-7/+7
	In dsa_slave_create() there are 2 sections that take rtnl_lock(): MTU change and netdev registration. They are separated by PHY initialization. There isn't any strict ordering requirement except for the fact that netdev registration should be last. Therefore, we can perform the MTU change a bit later, after the PHY setup. A future change will then be able to merge the two rtnl_lock sections into one. Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Reviewed-by: Florian Fainelli <f.fainelli@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: lantiq_xrx200: convert to build_skb	Aleksander Jan Bajkowski	1	-20/+36
	We can increase the efficiency of rx path by using buffers to receive packets then build SKBs around them just before passing into the network stack. In contrast, preallocating SKBs too early reduces CPU cache efficiency. NAT Performance results on BT Home Hub 5A (kernel 5.10.89, mtu 1500): Down Up Before 577 Mbps 648 Mbps After 624 Mbps 695 Mbps Signed-off-by: Aleksander Jan Bajkowski <olek2@wp.pl> Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	net: lantiq_xrx200: increase napi poll weigth	Aleksander Jan Bajkowski	1	-2/+4
	NAT Performance results on BT Home Hub 5A (kernel 5.10.89, mtu 1500): Down Up Before 545 Mbps 625 Mbps After 577 Mbps 648 Mbps Signed-off-by: Aleksander Jan Bajkowski <olek2@wp.pl> Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	MIPS: lantiq: dma: increase descritor count	Aleksander Jan Bajkowski	1	-1/+1
	NAT Performance results on BT Home Hub 5A (kernel 5.10.89, mtu 1500): Down Up Before 539 Mbps 599 Mbps After 545 Mbps 625 Mbps Signed-off-by: Aleksander Jan Bajkowski <olek2@wp.pl> Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	testptp: set pin function before other requests	Miroslav Lichvar	1	-12/+12
	When the -L option of the testptp utility is specified with other options (e.g. -p to enable PPS output), the user probably wants to apply it to the pin configured by the -L option. Reorder the code to set the pin function before other function requests to avoid confusing users. Signed-off-by: Miroslav Lichvar <mlichvar@redhat.com> Reviewed-by: Vladimir Oltean <olteanv@gmail.com> Acked-by: Richard Cochran <richardcochran@gmail.com> Link: https://lore.kernel.org/r/20220105152506.3256026-1-mlichvar@redhat.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	net: gemini: allow any RGMII interface mode	Russell King (Oracle)	1	-3/+6
	The four RGMII interface modes take care of the required RGMII delay configuration at the PHY and should not be limited by the network MAC driver. Sadly, gemini was only permitting RGMII mode with no delays, which would require the required delay to be inserted via PCB tracking or by the MAC. However, there are designs that require the PHY to add the delay, which is impossible without Gemini permitting the other three PHY interface modes. Fix the driver to allow these. Signed-off-by: Russell King (Oracle) <rmk+kernel@armlinux.org.uk> Reviewed-by: Linus Walleij <linus.walleij@linaro.org> Tested-by: Corentin Labbe <clabbe.montjoie@gmail.com> Link: https://lore.kernel.org/r/E1n4mpT-002PLd-Ha@rmk-PC.armlinux.org.uk Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	net: phy: marvell: configure RGMII delays for 88E1118	Russell King (Oracle)	1	-0/+6
	Corentin Labbe reports that the SSI 1328 does not work when allowing the PHY to operate at gigabit speeds, but does work with the generic PHY driver. This appears to be because m88e1118_config_init() writes a fixed value to the MSCR register, claiming that this is to enable 1G speeds. However, this always sets bits 4 and 5, enabling RGMII transmit and receive delays. The suspicion is that the original board this was added for required the delays to make 1G speeds work. Add the necessary configuration for RGMII delays for the 88E1118 to bring this into line with the requirements for RGMII support, and thus make the SSI 1328 work. Corentin Labbe has tested this on gemini-ssi1328 and gemini-ns2502. Reported-by: Corentin Labbe <clabbe.montjoie@gmail.com> Tested-by: Corentin Labbe <clabbe.montjoie@gmail.com> Signed-off-by: Russell King (Oracle) <rmk+kernel@armlinux.org.uk> Reviewed-by: Andrew Lunn <andrew@lunn.ch> Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	net: phy: marvell: use phy_write_paged() to set MSCR	Russell King (Oracle)	1	-14/+8
	Use phy_write_paged() in m88e1118_config_init() to set the MSCR value. We leave the other paged write for the LEDs in case the DT register parsing is relying on this page. Signed-off-by: Russell King (Oracle) <rmk+kernel@armlinux.org.uk> Reviewed-by: Andrew Lunn <andrew@lunn.ch> Tested-by: Corentin Labbe <clabbe.montjoie@gmail.com> Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	selftests: set amt.sh executable	Taehee Yoo	1	-0/+0
	amt.sh test script will not work because it doesn't have execution permission. So, it adds execution permission. Reported-by: Hangbin Liu <liuhangbin@gmail.com> Fixes: c08e8baea78e ("selftests: add amt interface selftest script") Signed-off-by: Taehee Yoo <ap420073@gmail.com> Link: https://lore.kernel.org/r/20220105144436.13415-1-ap420073@gmail.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	Revert "net: usb: r8152: Add MAC passthrough support for more Lenovo Docks"	Aaron Ma	1	-3/+6
	This reverts commit f77b83b5bbab53d2be339184838b19ed2c62c0a5. This change breaks multiple usb to ethernet dongles attached on Lenovo USB hub. Fixes: f77b83b5bbab ("net: usb: r8152: Add MAC passthrough support for more Lenovo Docks") Signed-off-by: Aaron Ma <aaron.ma@canonical.com> Link: https://lore.kernel.org/r/20220105155102.8557-1-aaron.ma@canonical.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	netlink: do not allocate a device refcount tracker in ethnl_default_notify()	Eric Dumazet	1	-1/+0
	As reported by Johannes, the tracker allocated in ethnl_default_notify() is not really needed, as this function is not expected to change a device reference count. Fixes: e4b8954074f6 ("netlink: add net device refcount tracker to struct ethnl_req_info") Signed-off-by: Eric Dumazet <edumazet@google.com> Reported-by: Johannes Berg <johannes@sipsolutions.net> Tested-by: Johannes Berg <johannes@sipsolutions.net> Link: https://lore.kernel.org/r/20220105170849.2610470-1-eric.dumazet@gmail.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	net/sched: add missing tracker information in qdisc_create()	Eric Dumazet	1	-1/+1
	qdisc_create() error path needs to use dev_put_track() because qdisc_alloc() allocated the tracker. Fixes: 606509f27f67 ("net/sched: add net device refcount tracker to struct Qdisc") Signed-off-by: Eric Dumazet <edumazet@google.com> Link: https://lore.kernel.org/r/20220104170439.3790052-1-eric.dumazet@gmail.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>
2022-01-05	net: dsa: remove cross-chip support for HSR	Vladimir Oltean	3	-49/+13
	The cross-chip notifiers for HSR are bypass operations, meaning that even though all switches in a tree are notified, only the switch specified in the info structure is targeted. We can eliminate the unnecessary complexity by deleting the cross-chip notifier logic and calling the ds->ops straight from port.c. Cc: George McCollister <george.mccollister@gmail.com> Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Reviewed-by: George McCollister <george.mccollister@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: dsa: remove cross-chip support for MRP	Vladimir Oltean	3	-106/+20
	The cross-chip notifiers for MRP are bypass operations, meaning that even though all switches in a tree are notified, only the switch specified in the info structure is targeted. We can eliminate the unnecessary complexity by deleting the cross-chip notifier logic and calling the ds->ops straight from port.c. Cc: Horatiu Vultur <horatiu.vultur@microchip.com> Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: dsa: fix incorrect function pointer check for MRP ring roles	Vladimir Oltean	1	-2/+2
	The cross-chip notifier boilerplate code meant to check the presence of ds->ops->port_mrp_add_ring_role before calling it, but checked ds->ops->port_mrp_add instead, before calling ds->ops->port_mrp_add_ring_role. Therefore, a driver which implements one operation but not the other would trigger a NULL pointer dereference. There isn't any such driver in DSA yet, so there is no reason to backport the change. Issue found through code inspection. Cc: Horatiu Vultur <horatiu.vultur@microchip.com> Fixes: c595c4330da0 ("net: dsa: add MRP support") Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	mlxsw: pci: Avoid flow control for EMAD packets	Danielle Ratson	2	-1/+17
	Locally generated packets ingress the device through its CPU port. When the CPU port is congested and there are not enough credits in its headroom buffer, packets can be dropped. While this might be acceptable for data packets that traverse the network, configuration packets exchanged between the host and the device (EMADs) should not be subjected to this flow control. The "sdq_lp" bit in the SDQ (Send Descriptor Queue) context allows the host to instruct the device to treat packets sent on this queue as "local processing" and always process them, regardless of the state of the CPU port's headroom. Add the definition of this bit and set it for the dedicated SDQ reserved for the transmission of EMAD packets. This makes the "local processing" bit in the WQE (Work Queue Element) redundant, so clear it. Signed-off-by: Danielle Ratson <danieller@nvidia.com> Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: dsa: combine two holes in struct dsa_switch_tree	Vladimir Oltean	1	-3/+3
	There is a 7 byte hole after dst->setup and a 4 byte hole after dst->default_proto. Combining them, we have a single hole of just 3 bytes on 64 bit machines. Before: pahole -C dsa_switch_tree net/dsa/slave.o struct dsa_switch_tree { struct list_head list; /* 0 16 / struct list_head ports; / 16 16 / struct raw_notifier_head nh; / 32 8 / unsigned int index; / 40 4 / struct kref refcount; / 44 4 / struct net_device * lags; /* 48 8 / bool setup; / 56 1 / / XXX 7 bytes hole, try to pack / / --- cacheline 1 boundary (64 bytes) --- / const struct dsa_device_ops tag_ops; /* 64 8 / enum dsa_tag_protocol default_proto; / 72 4 / / XXX 4 bytes hole, try to pack / struct dsa_platform_data pd; /* 80 8 / struct list_head rtable; / 88 16 / unsigned int lags_len; / 104 4 / unsigned int last_switch; / 108 4 / / size: 112, cachelines: 2, members: 13 / / sum members: 101, holes: 2, sum holes: 11 / / last cacheline: 48 bytes / }; After: pahole -C dsa_switch_tree net/dsa/slave.o struct dsa_switch_tree { struct list_head list; / 0 16 / struct list_head ports; / 16 16 / struct raw_notifier_head nh; / 32 8 / unsigned int index; / 40 4 / struct kref refcount; / 44 4 / struct net_device * lags; /* 48 8 / const struct dsa_device_ops tag_ops; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / enum dsa_tag_protocol default_proto; / 64 4 / bool setup; / 68 1 / / XXX 3 bytes hole, try to pack / struct dsa_platform_data pd; /* 72 8 / struct list_head rtable; / 80 16 / unsigned int lags_len; / 96 4 / unsigned int last_switch; / 100 4 / / size: 104, cachelines: 2, members: 13 / / sum members: 101, holes: 1, sum holes: 3 / / last cacheline: 40 bytes */ }; Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: dsa: move dsa_switch_tree :: ports and lags to first cache line	Vladimir Oltean	1	-7/+9
	dst->ports is accessed most notably by dsa_master_find_slave(), which is invoked in the RX path. dst->lags is accessed by dsa_lag_dev(), which is invoked in the RX path of tag_dsa.c. dst->tag_ops, dst->default_proto and dst->pd don't need to be in the first cache line, so they are moved out by this change. Before: pahole -C dsa_switch_tree net/dsa/slave.o struct dsa_switch_tree { struct list_head list; /* 0 16 / struct raw_notifier_head nh; / 16 8 / unsigned int index; / 24 4 / struct kref refcount; / 28 4 / bool setup; / 32 1 / / XXX 7 bytes hole, try to pack / const struct dsa_device_ops tag_ops; /* 40 8 / enum dsa_tag_protocol default_proto; / 48 4 / / XXX 4 bytes hole, try to pack / struct dsa_platform_data pd; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / struct list_head ports; / 64 16 / struct list_head rtable; / 80 16 / struct net_device * lags; /* 96 8 / unsigned int lags_len; / 104 4 / unsigned int last_switch; / 108 4 / / size: 112, cachelines: 2, members: 13 / / sum members: 101, holes: 2, sum holes: 11 / / last cacheline: 48 bytes / }; After: pahole -C dsa_switch_tree net/dsa/slave.o struct dsa_switch_tree { struct list_head list; / 0 16 / struct list_head ports; / 16 16 / struct raw_notifier_head nh; / 32 8 / unsigned int index; / 40 4 / struct kref refcount; / 44 4 / struct net_device * lags; /* 48 8 / bool setup; / 56 1 / / XXX 7 bytes hole, try to pack / / --- cacheline 1 boundary (64 bytes) --- / const struct dsa_device_ops tag_ops; /* 64 8 / enum dsa_tag_protocol default_proto; / 72 4 / / XXX 4 bytes hole, try to pack / struct dsa_platform_data pd; /* 80 8 / struct list_head rtable; / 88 16 / unsigned int lags_len; / 104 4 / unsigned int last_switch; / 108 4 / / size: 112, cachelines: 2, members: 13 / / sum members: 101, holes: 2, sum holes: 11 / / last cacheline: 48 bytes */ }; Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: dsa: make dsa_switch :: num_ports an unsigned int	Vladimir Oltean	2	-2/+2
	Currently, num_ports is declared as size_t, which is defined as __kernel_ulong_t, therefore it occupies 8 bytes of memory. Even switches with port numbers in the range of tens are exotic, so there is no need for this amount of storage. Additionally, because the max_num_bridges member right above it is also 4 bytes, it means the compiler needs to add padding between the last 2 fields. By reducing the size, we don't need that padding and can reduce the struct size. Before: pahole -C dsa_switch net/dsa/slave.o struct dsa_switch { struct device * dev; /* 0 8 / struct dsa_switch_tree dst; /* 8 8 / unsigned int index; / 16 4 / u32 setup:1; / 20: 0 4 / u32 vlan_filtering_is_global:1; / 20: 1 4 / u32 needs_standalone_vlan_filtering:1; / 20: 2 4 / u32 configure_vlan_while_not_filtering:1; / 20: 3 4 / u32 untag_bridge_pvid:1; / 20: 4 4 / u32 assisted_learning_on_cpu_port:1; / 20: 5 4 / u32 vlan_filtering:1; / 20: 6 4 / u32 pcs_poll:1; / 20: 7 4 / u32 mtu_enforcement_ingress:1; / 20: 8 4 / / XXX 23 bits hole, try to pack / struct notifier_block nb; / 24 24 / / XXX last struct has 4 bytes of padding / void priv; /* 48 8 / void tagger_data; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / struct dsa_chip_data cd; /* 64 8 / const struct dsa_switch_ops ops; /* 72 8 / u32 phys_mii_mask; / 80 4 / / XXX 4 bytes hole, try to pack / struct mii_bus slave_mii_bus; /* 88 8 / unsigned int ageing_time_min; / 96 4 / unsigned int ageing_time_max; / 100 4 / struct dsa_8021q_context tag_8021q_ctx; /* 104 8 / struct devlink devlink; /* 112 8 / unsigned int num_tx_queues; / 120 4 / unsigned int num_lag_ids; / 124 4 / / --- cacheline 2 boundary (128 bytes) --- / unsigned int max_num_bridges; / 128 4 / / XXX 4 bytes hole, try to pack / size_t num_ports; / 136 8 / / size: 144, cachelines: 3, members: 27 / / sum members: 132, holes: 2, sum holes: 8 / / sum bitfield members: 9 bits, bit holes: 1, sum bit holes: 23 bits / / paddings: 1, sum paddings: 4 / / last cacheline: 16 bytes / }; After: pahole -C dsa_switch net/dsa/slave.o struct dsa_switch { struct device dev; /* 0 8 / struct dsa_switch_tree dst; /* 8 8 / unsigned int index; / 16 4 / u32 setup:1; / 20: 0 4 / u32 vlan_filtering_is_global:1; / 20: 1 4 / u32 needs_standalone_vlan_filtering:1; / 20: 2 4 / u32 configure_vlan_while_not_filtering:1; / 20: 3 4 / u32 untag_bridge_pvid:1; / 20: 4 4 / u32 assisted_learning_on_cpu_port:1; / 20: 5 4 / u32 vlan_filtering:1; / 20: 6 4 / u32 pcs_poll:1; / 20: 7 4 / u32 mtu_enforcement_ingress:1; / 20: 8 4 / / XXX 23 bits hole, try to pack / struct notifier_block nb; / 24 24 / / XXX last struct has 4 bytes of padding / void priv; /* 48 8 / void tagger_data; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / struct dsa_chip_data cd; /* 64 8 / const struct dsa_switch_ops ops; /* 72 8 / u32 phys_mii_mask; / 80 4 / / XXX 4 bytes hole, try to pack / struct mii_bus slave_mii_bus; /* 88 8 / unsigned int ageing_time_min; / 96 4 / unsigned int ageing_time_max; / 100 4 / struct dsa_8021q_context tag_8021q_ctx; /* 104 8 / struct devlink devlink; /* 112 8 / unsigned int num_tx_queues; / 120 4 / unsigned int num_lag_ids; / 124 4 / / --- cacheline 2 boundary (128 bytes) --- / unsigned int max_num_bridges; / 128 4 / unsigned int num_ports; / 132 4 / / size: 136, cachelines: 3, members: 27 / / sum members: 128, holes: 1, sum holes: 4 / / sum bitfield members: 9 bits, bit holes: 1, sum bit holes: 23 bits / / paddings: 1, sum paddings: 4 / / last cacheline: 8 bytes */ }; Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: dsa: merge all bools of struct dsa_switch into a single u32	Vladimir Oltean	1	-46/+51
	struct dsa_switch has 9 boolean properties, many of which are in fact set by drivers for custom behavior (vlan_filtering_is_global, needs_standalone_vlan_filtering, etc etc). The binary layout of the structure could be improved. For example, the "bool setup" at the beginning introduces a gratuitous 7 byte hole in the first cache line. The change merges all boolean properties into bitfields of an u32, and places that u32 in the first cache line of the structure, since many bools are accessed from the data path (untag_bridge_pvid, vlan_filtering, vlan_filtering_is_global). We place this u32 after the existing ds->index, which is also 4 bytes in size. As a positive side effect, ds->tagger_data now fits into the first cache line too, because 4 bytes are saved. Before: pahole -C dsa_switch net/dsa/slave.o struct dsa_switch { bool setup; /* 0 1 / / XXX 7 bytes hole, try to pack / struct device dev; /* 8 8 / struct dsa_switch_tree dst; /* 16 8 / unsigned int index; / 24 4 / / XXX 4 bytes hole, try to pack / struct notifier_block nb; / 32 24 / / XXX last struct has 4 bytes of padding / void priv; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / void tagger_data; /* 64 8 / struct dsa_chip_data cd; /* 72 8 / const struct dsa_switch_ops ops; /* 80 8 / u32 phys_mii_mask; / 88 4 / / XXX 4 bytes hole, try to pack / struct mii_bus slave_mii_bus; /* 96 8 / unsigned int ageing_time_min; / 104 4 / unsigned int ageing_time_max; / 108 4 / struct dsa_8021q_context tag_8021q_ctx; /* 112 8 / struct devlink devlink; /* 120 8 / / --- cacheline 2 boundary (128 bytes) --- / unsigned int num_tx_queues; / 128 4 / bool vlan_filtering_is_global; / 132 1 / bool needs_standalone_vlan_filtering; / 133 1 / bool configure_vlan_while_not_filtering; / 134 1 / bool untag_bridge_pvid; / 135 1 / bool assisted_learning_on_cpu_port; / 136 1 / bool vlan_filtering; / 137 1 / bool pcs_poll; / 138 1 / bool mtu_enforcement_ingress; / 139 1 / unsigned int num_lag_ids; / 140 4 / unsigned int max_num_bridges; / 144 4 / / XXX 4 bytes hole, try to pack / size_t num_ports; / 152 8 / / size: 160, cachelines: 3, members: 27 / / sum members: 141, holes: 4, sum holes: 19 / / paddings: 1, sum paddings: 4 / / last cacheline: 32 bytes / }; After: pahole -C dsa_switch net/dsa/slave.o struct dsa_switch { struct device dev; /* 0 8 / struct dsa_switch_tree dst; /* 8 8 / unsigned int index; / 16 4 / u32 setup:1; / 20: 0 4 / u32 vlan_filtering_is_global:1; / 20: 1 4 / u32 needs_standalone_vlan_filtering:1; / 20: 2 4 / u32 configure_vlan_while_not_filtering:1; / 20: 3 4 / u32 untag_bridge_pvid:1; / 20: 4 4 / u32 assisted_learning_on_cpu_port:1; / 20: 5 4 / u32 vlan_filtering:1; / 20: 6 4 / u32 pcs_poll:1; / 20: 7 4 / u32 mtu_enforcement_ingress:1; / 20: 8 4 / / XXX 23 bits hole, try to pack / struct notifier_block nb; / 24 24 / / XXX last struct has 4 bytes of padding / void priv; /* 48 8 / void tagger_data; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / struct dsa_chip_data cd; /* 64 8 / const struct dsa_switch_ops ops; /* 72 8 / u32 phys_mii_mask; / 80 4 / / XXX 4 bytes hole, try to pack / struct mii_bus slave_mii_bus; /* 88 8 / unsigned int ageing_time_min; / 96 4 / unsigned int ageing_time_max; / 100 4 / struct dsa_8021q_context tag_8021q_ctx; /* 104 8 / struct devlink devlink; /* 112 8 / unsigned int num_tx_queues; / 120 4 / unsigned int num_lag_ids; / 124 4 / / --- cacheline 2 boundary (128 bytes) --- / unsigned int max_num_bridges; / 128 4 / / XXX 4 bytes hole, try to pack / size_t num_ports; / 136 8 / / size: 144, cachelines: 3, members: 27 / / sum members: 132, holes: 2, sum holes: 8 / / sum bitfield members: 9 bits, bit holes: 1, sum bit holes: 23 bits / / paddings: 1, sum paddings: 4 / / last cacheline: 16 bytes */ }; Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: dsa: move dsa_port :: type near dsa_port :: index	Vladimir Oltean	1	-2/+4
	Both dsa_port :: type and dsa_port :: index introduce a 4 octet hole after them, so we can group them together and the holes would be eliminated, turning 16 octets of storage into just 8. This makes the cpu_dp pointer fit in the first cache line, which is good, because dsa_slave_to_master(), called by dsa_enqueue_skb(), uses it. Before: pahole -C dsa_port net/dsa/slave.o struct dsa_port { union { struct net_device * master; /* 0 8 / struct net_device slave; /* 0 8 / }; / 0 8 / const struct dsa_device_ops tag_ops; /* 8 8 / struct dsa_switch_tree dst; /* 16 8 / struct sk_buff (rcv)(struct sk_buff , struct net_device ); / 24 8 / enum { DSA_PORT_TYPE_UNUSED = 0, DSA_PORT_TYPE_CPU = 1, DSA_PORT_TYPE_DSA = 2, DSA_PORT_TYPE_USER = 3, } type; / 32 4 / / XXX 4 bytes hole, try to pack / struct dsa_switch ds; /* 40 8 / unsigned int index; / 48 4 / / XXX 4 bytes hole, try to pack / const char name; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / struct dsa_port cpu_dp; /* 64 8 / u8 mac[6]; / 72 6 / u8 stp_state; / 78 1 / u8 vlan_filtering:1; / 79: 0 1 / u8 learning:1; / 79: 1 1 / u8 lag_tx_enabled:1; / 79: 2 1 / u8 devlink_port_setup:1; / 79: 3 1 / u8 setup:1; / 79: 4 1 / / XXX 3 bits hole, try to pack / struct device_node dn; /* 80 8 / unsigned int ageing_time; / 88 4 / / XXX 4 bytes hole, try to pack / struct dsa_bridge bridge; /* 96 8 / struct devlink_port devlink_port; / 104 288 / / --- cacheline 6 boundary (384 bytes) was 8 bytes ago --- / struct phylink pl; /* 392 8 / struct phylink_config pl_config; / 400 40 / struct net_device lag_dev; /* 440 8 / / --- cacheline 7 boundary (448 bytes) --- / struct net_device hsr_dev; /* 448 8 / struct list_head list; / 456 16 / const struct ethtool_ops orig_ethtool_ops; /* 472 8 / const struct dsa_netdevice_ops netdev_ops; /* 480 8 / struct mutex addr_lists_lock; / 488 32 / / --- cacheline 8 boundary (512 bytes) was 8 bytes ago --- / struct list_head fdbs; / 520 16 / struct list_head mdbs; / 536 16 / / size: 552, cachelines: 9, members: 30 / / sum members: 539, holes: 3, sum holes: 12 / / sum bitfield members: 5 bits, bit holes: 1, sum bit holes: 3 bits / / last cacheline: 40 bytes / }; After: pahole -C dsa_port net/dsa/slave.o struct dsa_port { union { struct net_device master; /* 0 8 / struct net_device slave; /* 0 8 / }; / 0 8 / const struct dsa_device_ops tag_ops; /* 8 8 / struct dsa_switch_tree dst; /* 16 8 / struct sk_buff (rcv)(struct sk_buff , struct net_device ); / 24 8 / struct dsa_switch ds; /* 32 8 / unsigned int index; / 40 4 / enum { DSA_PORT_TYPE_UNUSED = 0, DSA_PORT_TYPE_CPU = 1, DSA_PORT_TYPE_DSA = 2, DSA_PORT_TYPE_USER = 3, } type; / 44 4 / const char name; /* 48 8 / struct dsa_port cpu_dp; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / u8 mac[6]; / 64 6 / u8 stp_state; / 70 1 / u8 vlan_filtering:1; / 71: 0 1 / u8 learning:1; / 71: 1 1 / u8 lag_tx_enabled:1; / 71: 2 1 / u8 devlink_port_setup:1; / 71: 3 1 / u8 setup:1; / 71: 4 1 / / XXX 3 bits hole, try to pack / struct device_node dn; /* 72 8 / unsigned int ageing_time; / 80 4 / / XXX 4 bytes hole, try to pack / struct dsa_bridge bridge; /* 88 8 / struct devlink_port devlink_port; / 96 288 / / --- cacheline 6 boundary (384 bytes) --- / struct phylink pl; /* 384 8 / struct phylink_config pl_config; / 392 40 / struct net_device lag_dev; /* 432 8 / struct net_device hsr_dev; /* 440 8 / / --- cacheline 7 boundary (448 bytes) --- / struct list_head list; / 448 16 / const struct ethtool_ops orig_ethtool_ops; /* 464 8 / const struct dsa_netdevice_ops netdev_ops; /* 472 8 / struct mutex addr_lists_lock; / 480 32 / / --- cacheline 8 boundary (512 bytes) --- / struct list_head fdbs; / 512 16 / struct list_head mdbs; / 528 16 / / size: 544, cachelines: 9, members: 30 / / sum members: 539, holes: 1, sum holes: 4 / / sum bitfield members: 5 bits, bit holes: 1, sum bit holes: 3 bits / / last cacheline: 32 bytes */ }; Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: dsa: merge all bools of struct dsa_port into a single u8	Vladimir Oltean	1	-7/+10
	struct dsa_port has 5 bool members which create quite a number of 7 byte holes in the structure layout. By merging them all into bitfields of an u8, and placing that u8 in the 1-byte hole after dp->mac and dp->stp_state, we can reduce the structure size from 576 bytes to 552 bytes on arm64. Before: pahole -C dsa_port net/dsa/slave.o struct dsa_port { union { struct net_device * master; /* 0 8 / struct net_device slave; /* 0 8 / }; / 0 8 / const struct dsa_device_ops tag_ops; /* 8 8 / struct dsa_switch_tree dst; /* 16 8 / struct sk_buff (rcv)(struct sk_buff , struct net_device ); / 24 8 / enum { DSA_PORT_TYPE_UNUSED = 0, DSA_PORT_TYPE_CPU = 1, DSA_PORT_TYPE_DSA = 2, DSA_PORT_TYPE_USER = 3, } type; / 32 4 / / XXX 4 bytes hole, try to pack / struct dsa_switch ds; /* 40 8 / unsigned int index; / 48 4 / / XXX 4 bytes hole, try to pack / const char name; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / struct dsa_port cpu_dp; /* 64 8 / u8 mac[6]; / 72 6 / u8 stp_state; / 78 1 / / XXX 1 byte hole, try to pack / struct device_node dn; /* 80 8 / unsigned int ageing_time; / 88 4 / bool vlan_filtering; / 92 1 / bool learning; / 93 1 / / XXX 2 bytes hole, try to pack / struct dsa_bridge bridge; /* 96 8 / struct devlink_port devlink_port; / 104 288 / / --- cacheline 6 boundary (384 bytes) was 8 bytes ago --- / bool devlink_port_setup; / 392 1 / / XXX 7 bytes hole, try to pack / struct phylink pl; /* 400 8 / struct phylink_config pl_config; / 408 40 / / --- cacheline 7 boundary (448 bytes) --- / struct net_device lag_dev; /* 448 8 / bool lag_tx_enabled; / 456 1 / / XXX 7 bytes hole, try to pack / struct net_device hsr_dev; /* 464 8 / struct list_head list; / 472 16 / const struct ethtool_ops orig_ethtool_ops; /* 488 8 / const struct dsa_netdevice_ops netdev_ops; /* 496 8 / struct mutex addr_lists_lock; / 504 32 / / --- cacheline 8 boundary (512 bytes) was 24 bytes ago --- / struct list_head fdbs; / 536 16 / struct list_head mdbs; / 552 16 / bool setup; / 568 1 / / size: 576, cachelines: 9, members: 30 / / sum members: 544, holes: 6, sum holes: 25 / / padding: 7 / }; After: pahole -C dsa_port net/dsa/slave.o struct dsa_port { union { struct net_device master; /* 0 8 / struct net_device slave; /* 0 8 / }; / 0 8 / const struct dsa_device_ops tag_ops; /* 8 8 / struct dsa_switch_tree dst; /* 16 8 / struct sk_buff (rcv)(struct sk_buff , struct net_device ); / 24 8 / enum { DSA_PORT_TYPE_UNUSED = 0, DSA_PORT_TYPE_CPU = 1, DSA_PORT_TYPE_DSA = 2, DSA_PORT_TYPE_USER = 3, } type; / 32 4 / / XXX 4 bytes hole, try to pack / struct dsa_switch ds; /* 40 8 / unsigned int index; / 48 4 / / XXX 4 bytes hole, try to pack / const char name; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / struct dsa_port cpu_dp; /* 64 8 / u8 mac[6]; / 72 6 / u8 stp_state; / 78 1 / u8 vlan_filtering:1; / 79: 0 1 / u8 learning:1; / 79: 1 1 / u8 lag_tx_enabled:1; / 79: 2 1 / u8 devlink_port_setup:1; / 79: 3 1 / u8 setup:1; / 79: 4 1 / / XXX 3 bits hole, try to pack / struct device_node dn; /* 80 8 / unsigned int ageing_time; / 88 4 / / XXX 4 bytes hole, try to pack / struct dsa_bridge bridge; /* 96 8 / struct devlink_port devlink_port; / 104 288 / / --- cacheline 6 boundary (384 bytes) was 8 bytes ago --- / struct phylink pl; /* 392 8 / struct phylink_config pl_config; / 400 40 / struct net_device lag_dev; /* 440 8 / / --- cacheline 7 boundary (448 bytes) --- / struct net_device hsr_dev; /* 448 8 / struct list_head list; / 456 16 / const struct ethtool_ops orig_ethtool_ops; /* 472 8 / const struct dsa_netdevice_ops netdev_ops; /* 480 8 / struct mutex addr_lists_lock; / 488 32 / / --- cacheline 8 boundary (512 bytes) was 8 bytes ago --- / struct list_head fdbs; / 520 16 / struct list_head mdbs; / 536 16 / / size: 552, cachelines: 9, members: 30 / / sum members: 539, holes: 3, sum holes: 12 / / sum bitfield members: 5 bits, bit holes: 1, sum bit holes: 3 bits / / last cacheline: 40 bytes */ }; Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: dsa: move dsa_port :: stp_state near dsa_port :: mac	Vladimir Oltean	1	-1/+3
	The MAC address of a port is 6 octets in size, and this creates a 2 octet hole after it. There are some other u8 members of struct dsa_port that we can put in that hole. One such member is the stp_state. Before: pahole -C dsa_port net/dsa/slave.o struct dsa_port { union { struct net_device * master; /* 0 8 / struct net_device slave; /* 0 8 / }; / 0 8 / const struct dsa_device_ops tag_ops; /* 8 8 / struct dsa_switch_tree dst; /* 16 8 / struct sk_buff (rcv)(struct sk_buff , struct net_device ); / 24 8 / enum { DSA_PORT_TYPE_UNUSED = 0, DSA_PORT_TYPE_CPU = 1, DSA_PORT_TYPE_DSA = 2, DSA_PORT_TYPE_USER = 3, } type; / 32 4 / / XXX 4 bytes hole, try to pack / struct dsa_switch ds; /* 40 8 / unsigned int index; / 48 4 / / XXX 4 bytes hole, try to pack / const char name; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / struct dsa_port cpu_dp; /* 64 8 / u8 mac[6]; / 72 6 / / XXX 2 bytes hole, try to pack / struct device_node dn; /* 80 8 / unsigned int ageing_time; / 88 4 / bool vlan_filtering; / 92 1 / bool learning; / 93 1 / u8 stp_state; / 94 1 / / XXX 1 byte hole, try to pack / struct dsa_bridge bridge; /* 96 8 / struct devlink_port devlink_port; / 104 288 / / --- cacheline 6 boundary (384 bytes) was 8 bytes ago --- / bool devlink_port_setup; / 392 1 / / XXX 7 bytes hole, try to pack / struct phylink pl; /* 400 8 / struct phylink_config pl_config; / 408 40 / / --- cacheline 7 boundary (448 bytes) --- / struct net_device lag_dev; /* 448 8 / bool lag_tx_enabled; / 456 1 / / XXX 7 bytes hole, try to pack / struct net_device hsr_dev; /* 464 8 / struct list_head list; / 472 16 / const struct ethtool_ops orig_ethtool_ops; /* 488 8 / const struct dsa_netdevice_ops netdev_ops; /* 496 8 / struct mutex addr_lists_lock; / 504 32 / / --- cacheline 8 boundary (512 bytes) was 24 bytes ago --- / struct list_head fdbs; / 536 16 / struct list_head mdbs; / 552 16 / bool setup; / 568 1 / / size: 576, cachelines: 9, members: 30 / / sum members: 544, holes: 6, sum holes: 25 / / padding: 7 / }; After: pahole -C dsa_port net/dsa/slave.o struct dsa_port { union { struct net_device master; /* 0 8 / struct net_device slave; /* 0 8 / }; / 0 8 / const struct dsa_device_ops tag_ops; /* 8 8 / struct dsa_switch_tree dst; /* 16 8 / struct sk_buff (rcv)(struct sk_buff , struct net_device ); / 24 8 / enum { DSA_PORT_TYPE_UNUSED = 0, DSA_PORT_TYPE_CPU = 1, DSA_PORT_TYPE_DSA = 2, DSA_PORT_TYPE_USER = 3, } type; / 32 4 / / XXX 4 bytes hole, try to pack / struct dsa_switch ds; /* 40 8 / unsigned int index; / 48 4 / / XXX 4 bytes hole, try to pack / const char name; /* 56 8 / / --- cacheline 1 boundary (64 bytes) --- / struct dsa_port cpu_dp; /* 64 8 / u8 mac[6]; / 72 6 / u8 stp_state; / 78 1 / / XXX 1 byte hole, try to pack / struct device_node dn; /* 80 8 / unsigned int ageing_time; / 88 4 / bool vlan_filtering; / 92 1 / bool learning; / 93 1 / / XXX 2 bytes hole, try to pack / struct dsa_bridge bridge; /* 96 8 / struct devlink_port devlink_port; / 104 288 / / --- cacheline 6 boundary (384 bytes) was 8 bytes ago --- / bool devlink_port_setup; / 392 1 / / XXX 7 bytes hole, try to pack / struct phylink pl; /* 400 8 / struct phylink_config pl_config; / 408 40 / / --- cacheline 7 boundary (448 bytes) --- / struct net_device lag_dev; /* 448 8 / bool lag_tx_enabled; / 456 1 / / XXX 7 bytes hole, try to pack / struct net_device hsr_dev; /* 464 8 / struct list_head list; / 472 16 / const struct ethtool_ops orig_ethtool_ops; /* 488 8 / const struct dsa_netdevice_ops netdev_ops; /* 496 8 / struct mutex addr_lists_lock; / 504 32 / / --- cacheline 8 boundary (512 bytes) was 24 bytes ago --- / struct list_head fdbs; / 536 16 / struct list_head mdbs; / 552 16 / bool setup; / 568 1 / / size: 576, cachelines: 9, members: 30 / / sum members: 544, holes: 6, sum holes: 25 / / padding: 7 */ }; Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: create new common cmd code for PF and VF modules	Jie Wang	7	-329/+279
	Currently PF and VF use two sets of command code for modules to interact with firmware. These codes values are same espect the macro names. It is redundent to keep two sets of command code for same functions between PF and VF. So this patch firstly creates a unified command code for PF and VF module. We keep the macro name same with the PF command code name to avoid too many meaningless modifications. Secondly the new common command codes are used to replace the old ones in VF and deletes the old ones. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: refactor VF tqp stats APIs with new common tqp stats APIs	Jie Wang	2	-134/+11
	This patch firstly uses new tqp struct(hclge_comm_tqp) and removes the old VF tqp struct(hclgevf_tqp). All the tqp stats members used in VF module are modified according to the new hclge_comm_tqp. Secondly VF tqp stats APIs are refactored to use new common tqp stats APIs. The old tqp stats APIs in VF are deleted. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: refactor PF tqp stats APIs with new common tqp stats APIs	Jie Wang	2	-156/+17
	This patch firstly uses new tqp struct(hclge_comm_tqp) and deletes the old PF tqp struct(hclge_tqp). All the tqp stats members used in PF module are modified according to the new hclge_comm_tqp. Secondly PF tqp stats APIs are refactored to use new common tqp stats APIs. The old tqp stats APIs in PF are deleted. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: create new set of common tqp stats APIs for PF and VF reuse	Jie Wang	4	-2/+161
	This patch creates new set of common tqp stats structures and APIs for PF and VF tqp stats module. Subfunctions such as get tqp stats, update tqp stats and reset tqp stats are inclued in this patch. These new common tqp stats APIs will be used to replace the old PF and VF tqp stats APIs in next patches. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: refactor VF rss init APIs with new common rss init APIs	Jie Wang	2	-176/+25
	This patch uses common rss init APIs to replace the old APIs in VF rss module and removes the old VF rss init APIs. Several related Subfunctions and macros are also modified in this patch. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: refactor PF rss init APIs with new common rss init APIs	Jie Wang	2	-131/+11
	This patch uses common rss init APIs to replace the old APIs in PF rss module and deletes the old PF rss init APIs. Some related subfunctions and macros are also modified in this patch. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: create new set of common rss init APIs for PF and VF reuse	Jie Wang	3	-5/+213
	This patch creates new set of common rss init APIs for PF and VF rss module. Subfunctions called by rss init process are also created include rss tuple configuration and rss indirect table configuration. These new common rss init APIs will be used to replace the old PF and VF rss init APIs in next patches. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: refactor VF rss set APIs with new common rss set APIs	Jie Wang	3	-293/+42
	This patch uses new common rss set APIs to replace the old APIs in VF rss module and removes those old rss set APIs. The related macros in VF are also modified. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: refactor PF rss set APIs with new common rss set APIs	Jie Wang	3	-298/+36
	This patch uses new common rss set APIs to replace the old APIs in PF rss module and deletes the old rss set APIs. The related macros are also modified. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: create new set of common rss set APIs for PF and VF module	Jie Wang	3	-1/+230
	Currently, hns3 PF and VF rss module have two sets of rss set APIs to configure rss. There is no need to keep two sets of these same APIs. So this patch creates new set of common rss set APIs for PF and VF reuse. These new APIs will be used to unify old APIs in next patches. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: refactor VF rss get APIs with new common rss get APIs	Jie Wang	2	-99/+24
	This patch firstly uses new rss parameter struct(hclge_comm_rss_cfg) as child member of hclgevf_dev and deletes the original child rss parameter member(hclgevf_rss_cfg). All the rss parameter members used in VF rss module is modified according to the new hclge_comm_rss_cfg. Secondly VF rss get APIs are refactored to use new common rss get APIs. The old rss get APIs in VF are deleted. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: refactor PF rss get APIs with new common rss get APIs	Jie Wang	5	-160/+67
	This patch firstly uses new rss parameter struct(hclge_comm_rss_cfg) as child member of hclge_dev and deletes the original child rss parameter members in vport. All the vport child rss parameter members used in PF rss module is modified according to the new hclge_comm_rss_cfg. Secondly PF rss get APIs are refactored to use new common rss get APIs. The old rss get APIs in PF are deleted. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2022-01-05	net: hns3: create new set of common rss get APIs for PF and VF rss module	Jie Wang	3	-2/+197
	The PF and VF rss get APIs are almost the same espect the suffixes of API names. These same impementions bring double development and bugfix work. So this patch creates new common rss get APIs for PF and VF rss module. Subfunctions called by rss query process are also created(e.g. rss tuple conversion APIs). These new common rss get APIs will be used to replace PF and VF old rss APIs in next patches. Signed-off-by: Jie Wang <wangjie125@huawei.com> Signed-off-by: David S. Miller <davem@davemloft.net>