[2/3] batman-adv: additional checks for virtual interfaces on top of WiFi

Message ID 1468314486-29592-2-git-send-email-mareklindner@neomailbox.ch (mailing list archive)
State Superseded, archived
Delegated to: Marek Lindner
Headers

Commit Message

Marek Lindner July 12, 2016, 9:08 a.m. UTC
  In a few situations batman-adv tries to determine whether a given interface
is a WiFi interface to enable specific WiFi optimizations. If the interface
batman-adv has been configured with is a virtual interface (e.g. VLAN) it
would not be properly detected as WiFi interface and thus not benefit from
the special WiFi treatment.
This patch changes that by peeking under the hood whenever a virtual
interface is in play.

Signed-off-by: Marek Lindner <mareklindner@neomailbox.ch>
---
 net/batman-adv/hard-interface.c | 85 ++++++++++++++++++++++++++++++++++++++---
 net/batman-adv/hard-interface.h |  1 +
 2 files changed, 81 insertions(+), 5 deletions(-)
  

Comments

Sven Eckelmann July 12, 2016, 12:33 p.m. UTC | #1
On Dienstag, 12. Juli 2016 17:08:05 CEST Marek Lindner wrote:
> In a few situations batman-adv tries to determine whether a given interface
> is a WiFi interface to enable specific WiFi optimizations. If the interface
> batman-adv has been configured with is a virtual interface (e.g. VLAN) it
> would not be properly detected as WiFi interface and thus not benefit from
> the special WiFi treatment.
> This patch changes that by peeking under the hood whenever a virtual
> interface is in play.
> 
> Signed-off-by: Marek Lindner <mareklindner@neomailbox.ch>
> ---
>  net/batman-adv/hard-interface.c | 85 ++++++++++++++++++++++++++++++++++++++---
>  net/batman-adv/hard-interface.h |  1 +
>  2 files changed, 81 insertions(+), 5 deletions(-)

Most of the patch looks good. But you may want to first accept the netlink
patches and the rewrite the dev_get_link part.

> diff --git a/net/batman-adv/hard-interface.c b/net/batman-adv/hard-interface.c
> index 478977b..6324474 100644
> --- a/net/batman-adv/hard-interface.c
> +++ b/net/batman-adv/hard-interface.c
[...]
> +struct net_device *batadv_get_real_netdev(struct net_device *net_device)
> +{
[...]
> +	net = dev_net(hard_iface->soft_iface);
> +	ifindex = dev_get_iflink(net_device);
> +	real_netdev = dev_get_by_index(net, ifindex);

Andrew provided a patch [1] which gets the correct namespace of a link. You
should consider to first accept his patch and then use batadv_getlink_net to
get the correct namespace for the device instead of just assuming that it will
be in the same netns as the batman-adv interface.

Something like this:

       net = dev_net(hard_iface->soft_iface);
       ifindex = dev_get_iflink(net_device);
       real_net = = batadv_getlink_net(net_device, net);
       real_netdev = dev_get_by_index(real_net, ifindex);

Could it also be that this has to be done with the rtnl lock held? Otherwise
some of the the information may change while we are going through all the
steps to get the real interface.

Kind regards,
	Sven

[1] https://patchwork.open-mesh.org/patch/16442/
  
Sven Eckelmann July 12, 2016, 3:50 p.m. UTC | #2
On Dienstag, 12. Juli 2016 14:33:09 CEST Sven Eckelmann wrote:
[....]
> > diff --git a/net/batman-adv/hard-interface.c b/net/batman-adv/hard-
interface.c
> > index 478977b..6324474 100644
> > --- a/net/batman-adv/hard-interface.c
> > +++ b/net/batman-adv/hard-interface.c
> [...]
> > +struct net_device *batadv_get_real_netdev(struct net_device *net_device)
> > +{
> [...]
> > +	net = dev_net(hard_iface->soft_iface);
> > +	ifindex = dev_get_iflink(net_device);
> > +	real_netdev = dev_get_by_index(net, ifindex);
> 
> Andrew provided a patch [1] which gets the correct namespace of a link. You
> should consider to first accept his patch and then use batadv_getlink_net to
> get the correct namespace for the device instead of just assuming that it 
will
> be in the same netns as the batman-adv interface.
> 
> Something like this:
> 
>        net = dev_net(hard_iface->soft_iface);
>        ifindex = dev_get_iflink(net_device);
>        real_net = = batadv_getlink_net(net_device, net);
>        real_netdev = dev_get_by_index(real_net, ifindex);
> 
> Could it also be that this has to be done with the rtnl lock held? Otherwise
> some of the the information may change while we are going through all the
> steps to get the real interface.

Maybe you can add ASSERT_RTNL(); in this function. The caller of these
functions (see patch 3) have to make sure that they take the rtnl_lock().

batadv_is_cfg80211_netdev could get the rtnl_lock because it is used for the
elp code (which isn't inside the rtnl_lock, right?). But the call to
batadv_get_real_netdev in batadv_v_elp_get_throughput from Patch 3 still has
to be protected.

Or you could create an extra function which takes care of getting the real 
device for batadv_v_elp_get_throughput when it is a cfg80211 based one 
(otherwise returning NULL). This function can take care of getting the lock. 
Then you can also drop this
batadv_is_cfg80211_netdev -> _batadv_is_cfg80211_netdev change. Of course, you 
have to re-arrange many things in your patchset.

1. create batadv_is_cfg80211_netdev
2. introduce function that returns the device when it is a cfg80211
   * make use of it in batadv_v_elp_get_throughput and use it instead of
     batadv_is_cfg80211_netdev
3. introduce batadv_get_real_netdev
   * make use of it in your newly created function of patch 2
   * don't use batadv_get_real_netdev directly in batadv_v_elp_get_throughput

Kind regards,
	Sven
  
Sven Eckelmann Sept. 30, 2016, 10:31 a.m. UTC | #3
On Dienstag, 12. Juli 2016 17:50:27 CEST Sven Eckelmann wrote:
> On Dienstag, 12. Juli 2016 14:33:09 CEST Sven Eckelmann wrote:
> [....]
> 
> > > diff --git a/net/batman-adv/hard-interface.c b/net/batman-adv/hard-
> 
> interface.c
> 
> > > index 478977b..6324474 100644
> > > --- a/net/batman-adv/hard-interface.c
> > > +++ b/net/batman-adv/hard-interface.c
> > 
> > [...]
> > 
> > > +struct net_device *batadv_get_real_netdev(struct net_device
> > > *net_device)
> > > +{
> > 
> > [...]
> > 
> > > +	net = dev_net(hard_iface->soft_iface);
> > > +	ifindex = dev_get_iflink(net_device);
> > > +	real_netdev = dev_get_by_index(net, ifindex);
> > 
> > Andrew provided a patch [1] which gets the correct namespace of a link.
> > You
> > should consider to first accept his patch and then use batadv_getlink_net
> > to get the correct namespace for the device instead of just assuming that
> > it
> will
> 
> > be in the same netns as the batman-adv interface.
> > 
> > Something like this:
> >        net = dev_net(hard_iface->soft_iface);
> >        ifindex = dev_get_iflink(net_device);
> >        real_net = = batadv_getlink_net(net_device, net);
> >        real_netdev = dev_get_by_index(real_net, ifindex);
> > 
> > Could it also be that this has to be done with the rtnl lock held?
> > Otherwise some of the the information may change while we are going
> > through all the steps to get the real interface.
> 
> Maybe you can add ASSERT_RTNL(); in this function. The caller of these
> functions (see patch 3) have to make sure that they take the rtnl_lock().
> 
> batadv_is_cfg80211_netdev could get the rtnl_lock because it is used for the
> elp code (which isn't inside the rtnl_lock, right?). But the call to
> batadv_get_real_netdev in batadv_v_elp_get_throughput from Patch 3 still
> has to be protected.

This one failed because the rest of the code is rtnl-wise a complete mess. TT
code sometimes is in rtnl-locked context and sometimes not. But it always
wants to know if it is a wifi device. Same for multicast code which accesses
tt.

The rest of the code could be splitted to use two different functions but at
the end we even have problems that we cannot take the rtnl semaphore because
some other lock is enabled:


    [   34.023637] BUG: scheduling while atomic: kworker/u2:2/254/0x00000200
    [   34.030304] Modules linked in: pppoe ppp_async iptable_nat ath9k pppox ppp_generic nf_nat_ipv4 nf_conntrack_ipv6 nf_conntrack_ipv4 ipt_REJECT ipt_MASQUERADE ath9k_common xt_time xt_tcpudp xt_state xt_nat xt_multiport xt_mark xt_mac xt_limit xt_id xt_conntrack xt_comment xt_TCPMSS xt_REDIRECT xt_LOG xt_CT slhc nf_reject_ipv4 nf_nat_redirect nf_nat_masquerade_ipv4 nf_nat nf_log_ipv4 nf_defrag_ipv6 nf_defrag_ipv4 nf_conntrack_rtcache nf_conntrack iptable_raw iptable_mangle iptable_filter ip_tables crc_ccitt ath9k_hw act_skbedit act_mirred em_u32 cls_u32 cls_tcindex cls_flow cls_route cls_fw sch_hfsc sch_ingress ath10k_pci ath10k_core mac80211 ath batman_adv libcrc32c cfg80211 compat ip6t_REJECT nf_reject_ipv6 nf_log_ipv6 nf_log_common ip6table_raw ip6table_mangle ip6table_filter ip6_tables x_tables gpio_button_hotplug crc16 crc32c_generic crypto_hash
    [   34.108504] CPU: 0 PID: 254 Comm: kworker/u2:2 Tainted: G        W       4.4.21 #0
    [   34.116399] Workqueue: bat_events batadv_v_elp_packet_recv [batman_adv]
    [   34.123232] Stack : 839c2c20 00000000 82c07780 800a7168 83a05680 8040cd83 803a632c 000000fe
              803d0950 83a19b9c 80410000 800a50e4 82c07780 800a7168 803ab9c8 80410000
              00000003 83a19b9c 80410000 80095154 82c07780 83a19bd4 00000000 801f26d0
              8040be90 801f2600 830c58f4 83b5bf00 83b5b900 6261745f 6576656e 74730000
              00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
              ...
    [   34.160255] Call Trace:
    [   34.162795] [<80071c6c>] show_stack+0x50/0x84
    [   34.167314] [<8009b350>] __schedule_bug+0x44/0x60
    [   34.172173] [<80066770>] __schedule+0x64/0x608
    [   34.176776] [<80066d78>] schedule+0x64/0x7c
    [   34.181103] [<80066f48>] schedule_preempt_disabled+0x10/0x1c
    [   34.186957] [<80068244>] __mutex_lock_slowpath+0xb8/0x114
    [   34.192565] [<830ce03c>] batadv_is_wifi_netdev+0x14/0x38 [batman_adv]
    [   34.199274] [<830de7d4>] batadv_tt_local_add+0x1ec/0x850 [batman_adv]
    [   34.205985] [<830d0f00>] batadv_mcast_mla_update+0x4c8/0x5a4 [batman_adv]
    [   34.213043] [<830de00c>] batadv_tp_meter_init+0x15b4/0x1b4c [batman_adv]
    [   34.219988] 
    [   44.494164] random: nonblocking pool is initialized

I have just pushed the code to ecsv/wifi-master-failed. Just in case someone
is interested in working everything out.

> Or you could create an extra function which takes care of getting the real
> device for batadv_v_elp_get_throughput when it is a cfg80211 based one
> (otherwise returning NULL). This function can take care of getting the lock.
> Then you can also drop this
> batadv_is_cfg80211_netdev -> _batadv_is_cfg80211_netdev change. Of course,
> you have to re-arrange many things in your patchset.
> 
> 1. create batadv_is_cfg80211_netdev
> 2. introduce function that returns the device when it is a cfg80211
>    * make use of it in batadv_v_elp_get_throughput and use it instead of
>      batadv_is_cfg80211_netdev
> 3. introduce batadv_get_real_netdev
>    * make use of it in your newly created function of patch 2
>    * don't use batadv_get_real_netdev directly in
> batadv_v_elp_get_throughput

This idea has basically the same problem because we still would need to run
the batadv_is_wifi_netdev code.

Kind regards,
	Sven
  

Patch

diff --git a/net/batman-adv/hard-interface.c b/net/batman-adv/hard-interface.c
index 478977b..6324474 100644
--- a/net/batman-adv/hard-interface.c
+++ b/net/batman-adv/hard-interface.c
@@ -166,14 +166,51 @@  static bool batadv_is_valid_iface(const struct net_device *net_dev)
 }
 
 /**
- * batadv_is_cfg80211_netdev - check if the given net_device struct is a
- *  cfg80211 wifi interface
+ * batadv_get_real_netdev - check if the given net_device struct is a virtual
+ *  interface on top of another 'real' interface
+ * @net_device: the device to check
+ *
+ * Return: the 'real' net device or the original net device and NULL in case
+ *  of an error.
+ */
+struct net_device *batadv_get_real_netdev(struct net_device *net_device)
+{
+	struct batadv_hard_iface *hard_iface = NULL;
+	struct net_device *real_netdev = NULL;
+	struct net *net;
+	int ifindex;
+
+	if (!net_device)
+		return NULL;
+
+	if (net_device->ifindex == dev_get_iflink(net_device)) {
+		dev_hold(net_device);
+		return net_device;
+	}
+
+	hard_iface = batadv_hardif_get_by_netdev(net_device);
+	if (!hard_iface || !hard_iface->soft_iface)
+		goto out;
+
+	net = dev_net(hard_iface->soft_iface);
+	ifindex = dev_get_iflink(net_device);
+	real_netdev = dev_get_by_index(net, ifindex);
+
+out:
+	if (hard_iface)
+		batadv_hardif_put(hard_iface);
+	return real_netdev;
+}
+
+/**
+ * _batadv_is_cfg80211_netdev - check if the given net_device struct is a
+ *  cfg80211 wifi interface (without real netdev check)
  * @net_device: the device to check
  *
  * Return: true if the net device is a cfg80211 wireless device, false
  *  otherwise.
  */
-bool batadv_is_cfg80211_netdev(struct net_device *net_device)
+static bool _batadv_is_cfg80211_netdev(struct net_device *net_device)
 {
 	if (!net_device)
 		return false;
@@ -186,6 +223,32 @@  bool batadv_is_cfg80211_netdev(struct net_device *net_device)
 }
 
 /**
+ * batadv_is_cfg80211_netdev - check if the given net_device struct is a
+ *  cfg80211 wifi interface
+ * @net_device: the device to check
+ *
+ * Return: true if the net device is a cfg80211 wireless device, false
+ *  otherwise.
+ */
+bool batadv_is_cfg80211_netdev(struct net_device *net_device)
+{
+	struct net_device *real_netdev;
+	bool ret;
+
+	if (!net_device)
+		return false;
+
+	real_netdev = batadv_get_real_netdev(net_device);
+	if (!real_netdev)
+		return false;
+
+	ret = _batadv_is_cfg80211_netdev(real_netdev);
+
+	dev_put(real_netdev);
+	return ret;
+}
+
+/**
  * batadv_is_wifi_netdev - check if the given net_device struct is a wifi
  *  interface
  * @net_device: the device to check
@@ -194,18 +257,30 @@  bool batadv_is_cfg80211_netdev(struct net_device *net_device)
  */
 bool batadv_is_wifi_netdev(struct net_device *net_device)
 {
+	struct net_device *real_netdev;
+	bool ret;
+
 	if (!net_device)
 		return false;
 
+	real_netdev = batadv_get_real_netdev(net_device);
+	if (!real_netdev)
+		return false;
+
 #ifdef CONFIG_WIRELESS_EXT
 	/* pre-cfg80211 drivers have to implement WEXT, so it is possible to
 	 * check for wireless_handlers != NULL
 	 */
-	if (net_device->wireless_handlers)
+	if (real_netdev->wireless_handlers) {
+		dev_put(real_netdev);
 		return true;
+	}
 #endif
 
-	return batadv_is_cfg80211_netdev(net_device);
+	ret = _batadv_is_cfg80211_netdev(real_netdev);
+
+	dev_put(real_netdev);
+	return ret;
 }
 
 static struct batadv_hard_iface *
diff --git a/net/batman-adv/hard-interface.h b/net/batman-adv/hard-interface.h
index e0893de..a14316d 100644
--- a/net/batman-adv/hard-interface.h
+++ b/net/batman-adv/hard-interface.h
@@ -51,6 +51,7 @@  enum batadv_hard_if_cleanup {
 
 extern struct notifier_block batadv_hard_if_notifier;
 
+struct net_device *batadv_get_real_netdev(struct net_device *net_device);
 bool batadv_is_cfg80211_netdev(struct net_device *net_device);
 bool batadv_is_wifi_netdev(struct net_device *net_device);
 bool batadv_is_wifi_iface(int ifindex);