[1/3] batman-adv: fix lockdep splat when doing mcast_free

Message ID 1448305042-5806-2-git-send-email-sw@simonwunderlich.de (mailing list archive)
State Superseded, archived
Commit 025b743f6474bfdcd765a16ec6b098d06a40bf87
Headers

Commit Message

Simon Wunderlich Nov. 23, 2015, 6:57 p.m. UTC
  From: Simon Wunderlich <simon@open-mesh.com>

Signed-off-by: Simon Wunderlich <simon@open-mesh.com>
---
 net/batman-adv/multicast.c | 2 ++
 1 file changed, 2 insertions(+)
  

Comments

Antonio Quartulli Nov. 28, 2015, 2:49 a.m. UTC | #1
Simon Wunderlich:
> From: Simon Wunderlich <simon@open-mesh.com>
> 
> Signed-off-by: Simon Wunderlich <simon@open-mesh.com>
> ---
>  net/batman-adv/multicast.c | 2 ++
>  1 file changed, 2 insertions(+)
> 
> diff --git a/net/batman-adv/multicast.c b/net/batman-adv/multicast.c
> index 8abf488..d984eee 100644
> --- a/net/batman-adv/multicast.c
> +++ b/net/batman-adv/multicast.c
> @@ -801,7 +801,9 @@ void batadv_mcast_free(struct batadv_priv *bat_priv)
>  	batadv_tvlv_container_unregister(bat_priv, BATADV_TVLV_MCAST, 1);
>  	batadv_tvlv_handler_unregister(bat_priv, BATADV_TVLV_MCAST, 1);
>  
> +	spin_lock_bh(&bat_priv->tt.commit_lock);
>  	batadv_mcast_mla_tt_retract(bat_priv, NULL);
> +	spin_unlock_bh(&bat_priv->tt.commit_lock);

Linus,

can you please comment as of why batadv_mcast_mla_tt_retract() requires
to hold the tt.commit_lock ?


I see it calls batadv_tt_local_remove() but this does not really
requires the lock. Maybe you wanted to perform *all* the removes before
TT could do a commit ? Or is there any other reason?

Cheers,
  
Sven Eckelmann Nov. 28, 2015, 8:21 a.m. UTC | #2
On Saturday 28 November 2015 10:49:59 Antonio Quartulli wrote:
> Simon Wunderlich:
> > From: Simon Wunderlich <simon@open-mesh.com>
> > 
> > Signed-off-by: Simon Wunderlich <simon@open-mesh.com>
> > ---
> > 
> >  net/batman-adv/multicast.c | 2 ++
> >  1 file changed, 2 insertions(+)
> > 
> > diff --git a/net/batman-adv/multicast.c b/net/batman-adv/multicast.c
> > index 8abf488..d984eee 100644
> > --- a/net/batman-adv/multicast.c
> > +++ b/net/batman-adv/multicast.c
> > @@ -801,7 +801,9 @@ void batadv_mcast_free(struct batadv_priv *bat_priv)
> > 
> >  	batadv_tvlv_container_unregister(bat_priv, BATADV_TVLV_MCAST, 1);
> >  	batadv_tvlv_handler_unregister(bat_priv, BATADV_TVLV_MCAST, 1);
> > 
> > +	spin_lock_bh(&bat_priv->tt.commit_lock);
> > 
> >  	batadv_mcast_mla_tt_retract(bat_priv, NULL);
> > 
> > +	spin_unlock_bh(&bat_priv->tt.commit_lock);
> 
> Linus,
> 
> can you please comment as of why batadv_mcast_mla_tt_retract() requires
> to hold the tt.commit_lock ?

Maybe the relevant parts of the Oops can be added to the commit message:

    WARNING: CPU: 0 PID: 238 at net/batman-adv/multicast.c:142 batadv_mcast_mla_tt_retract+0x94/0x205 [batman_adv]()
    [...]
    Call Trace:
     [<ffffffff815fc597>] dump_stack+0x4b/0x64
     [<ffffffff810b34dc>] warn_slowpath_common+0xbc/0x120
     [<ffffffffa0024ec5>] ? batadv_mcast_mla_tt_retract+0x94/0x205 [batman_adv]
     [<ffffffff810b3705>] warn_slowpath_null+0x15/0x20
     [<ffffffffa0024ec5>] batadv_mcast_mla_tt_retract+0x94/0x205 [batman_adv]
     [<ffffffffa00273fe>] batadv_mcast_free+0x36/0x39 [batman_adv]
     [<ffffffffa0020c77>] batadv_mesh_free+0x7d/0x13f [batman_adv]
     [<ffffffffa0036a6b>] batadv_softif_free+0x15/0x25 [batman_adv]
     [...]

> I see it calls batadv_tt_local_remove() but this does not really
> requires the lock. Maybe you wanted to perform *all* the removes before
> TT could do a commit ? Or is there any other reason?
> 
> Cheers,

mcast.mla_list is protected by tt.commit_lock (see batadv_mcast_mla_tt_add,
batadv_mcast_mla_list_free and batadv_mcast_mla_tt_retract).

Kind regards,
	Sven
  
Antonio Quartulli Nov. 28, 2015, 12:56 p.m. UTC | #3
Sven Eckelmann:
> Maybe the relevant parts of the Oops can be added to the commit message:
> 
>     WARNING: CPU: 0 PID: 238 at net/batman-adv/multicast.c:142 batadv_mcast_mla_tt_retract+0x94/0x205 [batman_adv]()
>     [...]
>     Call Trace:
>      [<ffffffff815fc597>] dump_stack+0x4b/0x64
>      [<ffffffff810b34dc>] warn_slowpath_common+0xbc/0x120
>      [<ffffffffa0024ec5>] ? batadv_mcast_mla_tt_retract+0x94/0x205 [batman_adv]
>      [<ffffffff810b3705>] warn_slowpath_null+0x15/0x20
>      [<ffffffffa0024ec5>] batadv_mcast_mla_tt_retract+0x94/0x205 [batman_adv]
>      [<ffffffffa00273fe>] batadv_mcast_free+0x36/0x39 [batman_adv]
>      [<ffffffffa0020c77>] batadv_mesh_free+0x7d/0x13f [batman_adv]
>      [<ffffffffa0036a6b>] batadv_softif_free+0x15/0x25 [batman_adv]
>      [...]


I agree

> 
>> I see it calls batadv_tt_local_remove() but this does not really
>> requires the lock. Maybe you wanted to perform *all* the removes before
>> TT could do a commit ? Or is there any other reason?
>>
>> Cheers,
> 
> mcast.mla_list is protected by tt.commit_lock (see batadv_mcast_mla_tt_add,
> batadv_mcast_mla_list_free and batadv_mcast_mla_tt_retract).
> 

ok, this makes sense :)

Simon, I'd suggest you follow Sven's suggestion about adding the
oops/stacktrace and then you can also append my

Acked-by: Antonio Quartulli <antonio@meshcoding.com>

note that this patch is a bugfix, hence it should target maint.

Cheers,
  
Linus Lüssing Dec. 7, 2015, 10:12 p.m. UTC | #4
Hi,

Sorry for the late reply, I had also missed the Sven's
lockdep-assert-patch back then.

> On Saturday 28 November 2015 10:49:59 Antonio Quartulli wrote:
> > Linus,
> > 
> > can you please comment as of why batadv_mcast_mla_tt_retract() requires
> > to hold the tt.commit_lock ?

I don't think it does. At least if you say that a call to
batadv_tt_local_remove() as is does not need it (and it seems
there are other places calling tt_local_remove() without this
lock, too).

> [...]

On Sat, Nov 28, 2015 at 09:21:02AM +0100, Sven Eckelmann wrote:
> mcast.mla_list is protected by tt.commit_lock (see batadv_mcast_mla_tt_add,
> batadv_mcast_mla_list_free and batadv_mcast_mla_tt_retract).
> 

mcast.mla_list changes should be protected by the non-parallel code
flow: During runtime, batadv_mcast_mla_tt_update() is only called from
the self-rearming OGM scheduler thread -
batadv_mcast_mla_tt_update() will never run more than once at the
same time.

The second place for mcast.mla_list changes, batadv_mcast_free(), is
called only on shutdown after the OGM scheduling thread was stopped.


I don't think there should be such races regarding mcast.mla_list
- was something like that observed in the wild which lead to inserting
the lockdep-asserts?

Cheers, Linus
  
Linus Lüssing Dec. 7, 2015, 10:36 p.m. UTC | #5
PS: But granted, even if we came to the conclusion, that a
tt.commit_lock were unnecessary from the multicast code,
that would not be very obvious. Something needs to be done
either way.
  
Sven Eckelmann Dec. 14, 2015, 6:56 p.m. UTC | #6
On Monday 07 December 2015 23:12:42 Linus Lüssing wrote:
> On Sat, Nov 28, 2015 at 09:21:02AM +0100, Sven Eckelmann wrote:
> > mcast.mla_list is protected by tt.commit_lock (see
> > batadv_mcast_mla_tt_add,
> > batadv_mcast_mla_list_free and batadv_mcast_mla_tt_retract).
> 
> mcast.mla_list changes should be protected by the non-parallel code
> flow: During runtime, batadv_mcast_mla_tt_update() is only called from
> the self-rearming OGM scheduler thread -
> batadv_mcast_mla_tt_update() will never run more than once at the
> same time.
> 
> The second place for mcast.mla_list changes, batadv_mcast_free(), is
> called only on shutdown after the OGM scheduling thread was stopped.

The two functions with the lockdep assert are

* batadv_mcast_mla_list_free
* batadv_mcast_mla_tt_retract

(batadv_mcast_mla_tt_add looks basically like batadv_mcast_mla_list_free)

The call graphs are attached and these graphs have (pure) starting nodes which 
are not only batadv_exit and batadv_iv_ogm_schedule. Parts of them look like 
they are only protected because of tt.commit_lock.

> I don't think there should be such races regarding mcast.mla_list
> - was something like that observed in the wild which lead to inserting
> the lockdep-asserts?

We had multiple races and crashes which resulted in these asserts and 
locks+checks around list_del. The list_add modifications are still missing. 
And there are still other problems which are still open [1].

Kind regards,
	Sven

[1] e.g. https://www.open-mesh.org/issues/223
  
Linus Lüssing Dec. 15, 2015, 1:15 p.m. UTC | #7
On Mon, Dec 14, 2015 at 07:56:19PM +0100, Sven Eckelmann wrote:
> On Monday 07 December 2015 23:12:42 Linus Lüssing wrote:
> > On Sat, Nov 28, 2015 at 09:21:02AM +0100, Sven Eckelmann wrote:
> > > mcast.mla_list is protected by tt.commit_lock (see
> > > batadv_mcast_mla_tt_add,
> > > batadv_mcast_mla_list_free and batadv_mcast_mla_tt_retract).
> > 
> > mcast.mla_list changes should be protected by the non-parallel code
> > flow: During runtime, batadv_mcast_mla_tt_update() is only called from
> > the self-rearming OGM scheduler thread -
> > batadv_mcast_mla_tt_update() will never run more than once at the
> > same time.
> > 
> > The second place for mcast.mla_list changes, batadv_mcast_free(), is
> > called only on shutdown after the OGM scheduling thread was stopped.
> 
> The two functions with the lockdep assert are
> 
> * batadv_mcast_mla_list_free
> * batadv_mcast_mla_tt_retract
> 
> (batadv_mcast_mla_tt_add looks basically like batadv_mcast_mla_list_free)
> 
> The call graphs are attached and these graphs have (pure) starting nodes which 
> are not only batadv_exit and batadv_iv_ogm_schedule. Parts of them look like 
> they are only protected because of tt.commit_lock.

Thanks for these pictures! (btw. which tool did you use for that?)

The two non-colliding paths I had in mind were
batadv_iv_ogm_schedule() and batadv_mcast_free(), which looks
like:

batadv_mesh_free()
	-> batadv_purge_outstanding_packets()
		-> cancel_delayed_work_sync()	!
	[...]
	-> batadv_mcast_free()

Which ensures that no batadv_iv_ogm_schedule() is running before
calling batadv_mcast_free().


But I missed the path from batadv_update_min_mtu()... However,
it should not race with batadv_mcast_free() either, which is
called once the last hard-iface gets disabled:

batadv_hardif_disable_interface()
	-> batadv_purge_outstanding_packets()
		-> cancel_delayed_work_sync()	!
	-> dev_put(soft_iface)
		[ if last hard-iface, then soft-iface is out
		  of scope for any new batadv_update_mtu() and
		  gets freed: ]
		-> batadv_softif_free()
			-> batadv_mesh_free()
				-> batadv_mcast_free()

But with yet another path it is getting even more, rediculously
complicated... Just proving that trying to keep a lock-less update
for mla_list here is a bad, unmaintainable approach :).

So I'm definitely in favor of having some spinlock to refer to for
mcast.mla_list update, even if it isn't necessary for
batadv_mcast_free(). But I don't see a race for the current
mla_list updates either (otherwise I'd need a specific, more
verbose example, I guess...).


The question is, do we want to have Simon's patch for maint to
trickle down to 4.3 (where the lockdep patch got added) or back to
3.15 (where multicast.c got added)?

For stable kernels, we need a specific, precise, reproducable issue
to point to, right? (stable_kernel_rules.txt: 'It must fix a real
bug that bothers people (not a, "This could be a problem..." type
thing).'

Regards, Linus
  
Sven Eckelmann Dec. 15, 2015, 2:15 p.m. UTC | #8
On Tuesday 15 December 2015 14:15:33 Linus Lüssing wrote:
> On Mon, Dec 14, 2015 at 07:56:19PM +0100, Sven Eckelmann wrote:
> > On Monday 07 December 2015 23:12:42 Linus Lüssing wrote:
> > > On Sat, Nov 28, 2015 at 09:21:02AM +0100, Sven Eckelmann wrote:
> > > > mcast.mla_list is protected by tt.commit_lock (see
> > > > batadv_mcast_mla_tt_add,
> > > > batadv_mcast_mla_list_free and batadv_mcast_mla_tt_retract).
> > > 
> > > mcast.mla_list changes should be protected by the non-parallel code
> > > flow: During runtime, batadv_mcast_mla_tt_update() is only called from
> > > the self-rearming OGM scheduler thread -
> > > batadv_mcast_mla_tt_update() will never run more than once at the
> > > same time.
> > > 
> > > The second place for mcast.mla_list changes, batadv_mcast_free(), is
> > > called only on shutdown after the OGM scheduling thread was stopped.
> > 
> > The two functions with the lockdep assert are
> > 
> > * batadv_mcast_mla_list_free
> > * batadv_mcast_mla_tt_retract
> > 
> > (batadv_mcast_mla_tt_add looks basically like batadv_mcast_mla_list_free)
> > 
> > The call graphs are attached and these graphs have (pure) starting nodes
> > which are not only batadv_exit and batadv_iv_ogm_schedule. Parts of them
> > look like they are only protected because of tt.commit_lock.
> 
> Thanks for these pictures! (btw. which tool did you use for that?)

It is just doxygen with the dot (graphviz) enabled for all (also undocumented 
+ local/static functions).

> 
> The two non-colliding paths I had in mind were
> batadv_iv_ogm_schedule() and batadv_mcast_free(), which looks
> like:
> 
> batadv_mesh_free()
> 	-> batadv_purge_outstanding_packets()
> 		-> cancel_delayed_work_sync()	!
> 	[...]
> 	-> batadv_mcast_free()
> 
> Which ensures that no batadv_iv_ogm_schedule() is running before
> calling batadv_mcast_free().

That seems to be right.

> But I missed the path from batadv_update_min_mtu()... However,
> it should not race with batadv_mcast_free() either, which is
> called once the last hard-iface gets disabled:
> 
> batadv_hardif_disable_interface()
> 	-> batadv_purge_outstanding_packets()
> 		-> cancel_delayed_work_sync()	!
> 	-> dev_put(soft_iface)
> 		[ if last hard-iface, then soft-iface is out
> 		  of scope for any new batadv_update_mtu() and
> 		  gets freed: ]
> 		-> batadv_softif_free()
> 			-> batadv_mesh_free()
> 				-> batadv_mcast_free()
> 
> But with yet another path it is getting even more, rediculously
> complicated... Just proving that trying to keep a lock-less update
> for mla_list here is a bad, unmaintainable approach :).

Yes, and I am currently not trusting the reference/rcu code in batman-adv 
anymore. It seems to have a lot of backpointers (yes, loops are very bad for 
reference counting and the workarounds chosen in the code for that don't look 
very sane either). I actually don't want to check this mess to find out if 
there is a situation where were there can be an (incorrectly counted) 
reference somewhere back to it. Let us just assume that everything is fine 
here... better for everyones sanity.

> So I'm definitely in favor of having some spinlock to refer to for
> mcast.mla_list update, even if it isn't necessary for
> batadv_mcast_free(). But I don't see a race for the current
> mla_list updates either (otherwise I'd need a specific, more
> verbose example, I guess...).

The assert was more for having a common lock for all accesses to this list 
(ogm_schedule, mtu change event, hardif add/del, softif free/add). This lock 
already existed indirectly and protected the updates. The free functions 
should have the same protection (especially when changes are done to the code 
in the future which might call these functions in a different context). So 
right now the patch from Simon only works around the lockdep warning and 
prepares the code for the "future". So I don't think it is required for -
stable - especially when you are sure that there is no race between _free and 
_retract.

Kind regards,
	Sven
  

Patch

diff --git a/net/batman-adv/multicast.c b/net/batman-adv/multicast.c
index 8abf488..d984eee 100644
--- a/net/batman-adv/multicast.c
+++ b/net/batman-adv/multicast.c
@@ -801,7 +801,9 @@  void batadv_mcast_free(struct batadv_priv *bat_priv)
 	batadv_tvlv_container_unregister(bat_priv, BATADV_TVLV_MCAST, 1);
 	batadv_tvlv_handler_unregister(bat_priv, BATADV_TVLV_MCAST, 1);
 
+	spin_lock_bh(&bat_priv->tt.commit_lock);
 	batadv_mcast_mla_tt_retract(bat_priv, NULL);
+	spin_unlock_bh(&bat_priv->tt.commit_lock);
 }
 
 /**