LKML Archive on lore.kernel.org
help / color / mirror / Atom feed
* Re: [PATCH]Add notification for active Cell SPU tasks
[not found] <OF9A01B09B.B62F8342-ON8325723C.006A9343-8325723C.006B2236@us.ibm.com>
@ 2006-12-06 22:04 ` Maynard Johnson
2006-12-07 22:58 ` [Cbe-oss-dev] " Arnd Bergmann
0 siblings, 1 reply; 4+ messages in thread
From: Maynard Johnson @ 2006-12-06 22:04 UTC (permalink / raw)
To: Luke Browning, Arnd Bergmann
Cc: Luke Browning, maynardj,
linuxppc-dev-bounces+lukebrowning=us.ibm.com, linux-kernel,
linuxppc-dev, oprofile-list, cbe-oss-dev
Luke Browning wrote:
> linuxppc-dev-bounces+lukebrowning=us.ibm.com@ozlabs.org wrote on
> 12/04/2006 10:26:57:
>
> > linuxppc-dev-bounces+lukebrowning=us.ibm.com@ozlabs.org wrote on
> > 01/12/2006 06:01:15 PM:
> >
> > >
> > > Subject: Enable SPU switch notification to detect currently
> activeSPU tasks.
> > >
> > > From: Maynard Johnson <maynardj@us.ibm.com>
> > >
> > > This patch adds to the capability of spu_switch_event_register to
> notify the
> > > caller of currently active SPU tasks. It also exports
> > > spu_switch_event_register
> > > and spu_switch_event_unregister.
> > >
> > > Signed-off-by: Maynard Johnson <mpjohn@us.ibm.com>
> > >
> > >
> > > Index: linux-2.6.19-rc6-
> > > arnd1+patches/arch/powerpc/platforms/cell/spufs/sched.c
> > > ===================================================================
> > > --- linux-2.6.19-rc6-arnd1+patches.
> > > orig/arch/powerpc/platforms/cell/spufs/sched.c 2006-11-24 11:34:
> > > 44.884455680 -0600
> > > +++ linux-2.6.19-rc6-
> > > arnd1+patches/arch/powerpc/platforms/cell/spufs/sched.c 2006-12-01
> > > 13:57:21.864583264 -0600
> > > @@ -84,15 +84,37 @@
> > > ctx ? ctx->object_id : 0, spu);
> > > }
> > >
> > > +static void notify_spus_active(void)
> > > +{
> > > + int node;
> > > + for (node = 0; node < MAX_NUMNODES; node++) {
> > > + struct spu *spu;
> > > + mutex_lock(&spu_prio->active_mutex[node]);
> > > + list_for_each_entry(spu, &spu_prio->active_list[node], list) {
> > > + struct spu_context *ctx = spu->ctx;
> > > + blocking_notifier_call_chain(&spu_switch_notifier,
> > > + ctx ? ctx->object_id : 0, spu);
> > > + }
> > > + mutex_unlock(&spu_prio->active_mutex[node]);
> > > + }
> > > +
> > > +}
> > > +
> > > int spu_switch_event_register(struct notifier_block * n)
> > > {
> > > - return blocking_notifier_chain_register(&spu_switch_notifier, n);
> > > + int ret;
> > > + ret = blocking_notifier_chain_register(&spu_switch_notifier, n);
> > > + if (!ret)
> > > + notify_spus_active();
> > > + return ret;
> > > }
> > > +EXPORT_SYMBOL_GPL(spu_switch_event_register);
> > >
> > > int spu_switch_event_unregister(struct notifier_block * n)
> > > {
> > > return
> blocking_notifier_chain_unregister(&spu_switch_notifier, n);
> > > }
> > > +EXPORT_SYMBOL_GPL(spu_switch_event_unregister);
> > >
> > >
> > > static inline void bind_context(struct spu *spu, struct
> spu_context *ctx)
> >
> > Is this really the right strategy? First, it serializes all spu
> context
> > switching at the node level. Second, it performs 17 callouts for
>
I could be wrong, but I think if we moved the mutex_lock to be inside of
the list_for_each_entry loop, we could have a race condition. For
example, we obtain the next spu item from the spu_prio->active_mutex
list, then wait on the mutex which is being held for the purpose of
removing the very spu context we just obtained.
> > every context
> > switch. Can't oprofile internally derive the list of active spus
> from the
> > context switch callout.
>
Arnd would certainly know the answer to this off the top of his head,
but when I initially discussed the idea for this patch with him
(probably a couple months ago or so), he didn't suggest a better
alternative. Perhaps there is a way to do this with current SPUFS
code. Arnd, any comments on this?
> >
> > Also, the notify_spus_active() callout is dependent on the return
> code of
> > spu_switch_notify(). Should notification be hierarchical? If I
> > only register
> > for the second one, should my notification be dependent on the
> return code
> > of some non-related subsystem's handler.
>
I'm not exactly sure what you're saying here. Are you suggesting that a
user may only be interested in acitve SPU notification and, therefore,
shouldn't have to be depenent on the "standard" notification
registration succeeding? There may be a case for adding a new
registration function, I suppose; although, I'm not aware of any other
users of the SPUFS notification mechanism besides OProfile and PDT, and
we need notification of both active and future SPU tasks. But I would
not object to a new function.
> >
> > Does blocking_callchain_notifier internally check for the presence
> > of registered
> > handlers before it takes locks ...? We should ensure that there is
> > minimal overhead
> > when there are no registered handlers.
>
I won't pretend to be expert enough to critique the performance of that
code.
> >
> > Regards,
> > Luke___________________
>
> Any comments to my questions above. Seems like oprofile / pdt could
> derive the
> list of active spus from a single context switch callout. This patch
> will have
> a large impact on the performance of the system.
>
For OProfile, the registration is only done at the time when a user
starts the profiler to collect performance data, typically focusing on a
single application, so I don't see this as an impact on normal
production operations. Since you must have root authority to run
OProfile, it cannot be invoked by just any user for nefarious purposes.
-Maynard
>
> Luke
>
>------------------------------------------------------------------------
>
>_______________________________________________
>Linuxppc-dev mailing list
>Linuxppc-dev@ozlabs.org
>https://ozlabs.org/mailman/listinfo/linuxppc-dev
>
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [Cbe-oss-dev] [PATCH]Add notification for active Cell SPU tasks
2006-12-06 22:04 ` [PATCH]Add notification for active Cell SPU tasks Maynard Johnson
@ 2006-12-07 22:58 ` Arnd Bergmann
2006-12-08 15:04 ` Maynard Johnson
0 siblings, 1 reply; 4+ messages in thread
From: Arnd Bergmann @ 2006-12-07 22:58 UTC (permalink / raw)
To: cbe-oss-dev
Cc: Maynard Johnson, Luke Browning, maynardj,
linuxppc-dev-bounces+lukebrowning=us.ibm.com, Luke Browning,
linux-kernel, linuxppc-dev, oprofile-list
On Wednesday 06 December 2006 23:04, Maynard Johnson wrote:
> text(struct spu *spu, struct
> > spu_context *ctx)
> > >
> > > Is this really the right strategy? First, it serializes all spu
> > context
> > > switching at the node level. Second, it performs 17 callouts for
> >
> I could be wrong, but I think if we moved the mutex_lock to be inside of
> the list_for_each_entry loop, we could have a race condition. For
> example, we obtain the next spu item from the spu_prio->active_mutex
> list, then wait on the mutex which is being held for the purpose of
> removing the very spu context we just obtained.
>
> > > every context
> > > switch. Can't oprofile internally derive the list of active spus
> > from the
> > > context switch callout.
> >
> Arnd would certainly know the answer to this off the top of his head,
> but when I initially discussed the idea for this patch with him
> (probably a couple months ago or so), he didn't suggest a better
> alternative. Perhaps there is a way to do this with current SPUFS
> code. Arnd, any comments on this?
No code should ever need to look at other SPUs when performing an
operation on a given SPU, so we don't need to hold a global lock
during normal operation.
We have two cases that need to be handled:
- on each context unload and load (both for a full switch operation),
call to the profiling code with a pointer to the current context
and spu (context is NULL when unloading).
If the new context is not know yet, scan its overlay table (expensive)
and store information about it in an oprofile private object. Otherwise
just point to the currently active object, this should be really cheap.
- When enabling oprofile initially, scan all contexts that are currently
running on one of the SPUs. This is also expensive, but should happen
before the measurement starts so it does not impact the resulting data.
> > >
> > > Also, the notify_spus_active() callout is dependent on the return
> > code of
> > > spu_switch_notify(). Should notification be hierarchical? If I
> > > only register
> > > for the second one, should my notification be dependent on the
> > return code
> > > of some non-related subsystem's handler.
> >
> I'm not exactly sure what you're saying here. Are you suggesting that a
> user may only be interested in acitve SPU notification and, therefore,
> shouldn't have to be depenent on the "standard" notification
> registration succeeding? There may be a case for adding a new
> registration function, I suppose; although, I'm not aware of any other
> users of the SPUFS notification mechanism besides OProfile and PDT, and
> we need notification of both active and future SPU tasks. But I would
> not object to a new function.
>
I think what Luke was trying to get to is that notify_spus_active() should
not call blocking_notifier_call_chain(), since it will notify other users
as well as the newly registered one. Instead, it can simply call the
notifier function directly.
Arnd <><
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [Cbe-oss-dev] [PATCH]Add notification for active Cell SPU tasks
2006-12-07 22:58 ` [Cbe-oss-dev] " Arnd Bergmann
@ 2006-12-08 15:04 ` Maynard Johnson
0 siblings, 0 replies; 4+ messages in thread
From: Maynard Johnson @ 2006-12-08 15:04 UTC (permalink / raw)
To: Arnd Bergmann
Cc: cbe-oss-dev, maynardj, Luke Browning,
linuxppc-dev-bounces+lukebrowning=us.ibm.com, Luke Browning,
linux-kernel, linuxppc-dev, oprofile-list
Arnd Bergmann wrote:
>On Wednesday 06 December 2006 23:04, Maynard Johnson wrote:
>
>
>>text(struct spu *spu, struct
>>
>>
>>>spu_context *ctx)
>>>
>>>
>>>>Is this really the right strategy? First, it serializes all spu
>>>>
>>>>
>>>context
>>>
>>>
>>>>switching at the node level. Second, it performs 17 callouts for
>>>>
>>>>
>>I could be wrong, but I think if we moved the mutex_lock to be inside of
>>the list_for_each_entry loop, we could have a race condition. For
>>example, we obtain the next spu item from the spu_prio->active_mutex
>>list, then wait on the mutex which is being held for the purpose of
>>removing the very spu context we just obtained.
>>
>>
>>
>>>>every context
>>>>switch. Can't oprofile internally derive the list of active spus
>>>>
>>>>
>>>from the
>>>
>>>
>>>>context switch callout.
>>>>
>>>>
>>Arnd would certainly know the answer to this off the top of his head,
>>but when I initially discussed the idea for this patch with him
>>(probably a couple months ago or so), he didn't suggest a better
>>alternative. Perhaps there is a way to do this with current SPUFS
>>code. Arnd, any comments on this?
>>
>>
>
>
>
>No code should ever need to look at other SPUs when performing an
>operation on a given SPU, so we don't need to hold a global lock
>during normal operation.
>
>We have two cases that need to be handled:
>
>- on each context unload and load (both for a full switch operation),
> call to the profiling code with a pointer to the current context
> and spu (context is NULL when unloading).
>
> If the new context is not know yet, scan its overlay table (expensive)
> and store information about it in an oprofile private object. Otherwise
> just point to the currently active object, this should be really cheap.
>
>- When enabling oprofile initially, scan all contexts that are currently
> running on one of the SPUs. This is also expensive, but should happen
> before the measurement starts so it does not impact the resulting data.
>
>
>
>>>>Also, the notify_spus_active() callout is dependent on the return
>>>>
>>>>
>>>code of
>>>
>>>
>>>>spu_switch_notify(). Should notification be hierarchical? If I
>>>>only register
>>>>for the second one, should my notification be dependent on the
>>>>
>>>>
>>>return code
>>>
>>>
>>>>of some non-related subsystem's handler.
>>>>
>>>>
>>I'm not exactly sure what you're saying here. Are you suggesting that a
>>user may only be interested in acitve SPU notification and, therefore,
>>shouldn't have to be depenent on the "standard" notification
>>registration succeeding? There may be a case for adding a new
>>registration function, I suppose; although, I'm not aware of any other
>>users of the SPUFS notification mechanism besides OProfile and PDT, and
>>we need notification of both active and future SPU tasks. But I would
>>not object to a new function.
>>
>>
>>
>I think what Luke was trying to get to is that notify_spus_active() should
>not call blocking_notifier_call_chain(), since it will notify other users
>as well as the newly registered one. Instead, it can simply call the
>notifier function directly.
>
>
Ah, yes. Thanks to both of you for pointing that out. I'll fix that
and re-post.
-Maynard
> Arnd <><
>
>-------------------------------------------------------------------------
>Take Surveys. Earn Cash. Influence the Future of IT
>Join SourceForge.net's Techsay panel and you'll get the chance to share your
>opinions on IT & business topics through brief surveys - and earn cash
>http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
>_______________________________________________
>oprofile-list mailing list
>oprofile-list@lists.sourceforge.net
>https://lists.sourceforge.net/lists/listinfo/oprofile-list
>
>
^ permalink raw reply [flat|nested] 4+ messages in thread
* [PATCH]Add notification for active Cell SPU tasks
@ 2006-12-01 20:01 Maynard Johnson
0 siblings, 0 replies; 4+ messages in thread
From: Maynard Johnson @ 2006-12-01 20:01 UTC (permalink / raw)
To: cbe-oss-dev, linuxppc-dev, oprofile-list, linux-kernel
[-- Attachment #1: Type: text/plain, Size: 1 bytes --]
[-- Attachment #2: spu-notifier.patch --]
[-- Type: text/x-diff, Size: 1828 bytes --]
Subject: Enable SPU switch notification to detect currently active SPU tasks.
From: Maynard Johnson <maynardj@us.ibm.com>
This patch adds to the capability of spu_switch_event_register to notify the
caller of currently active SPU tasks. It also exports spu_switch_event_register
and spu_switch_event_unregister.
Signed-off-by: Maynard Johnson <mpjohn@us.ibm.com>
Index: linux-2.6.19-rc6-arnd1+patches/arch/powerpc/platforms/cell/spufs/sched.c
===================================================================
--- linux-2.6.19-rc6-arnd1+patches.orig/arch/powerpc/platforms/cell/spufs/sched.c 2006-11-24 11:34:44.884455680 -0600
+++ linux-2.6.19-rc6-arnd1+patches/arch/powerpc/platforms/cell/spufs/sched.c 2006-12-01 13:57:21.864583264 -0600
@@ -84,15 +84,37 @@
ctx ? ctx->object_id : 0, spu);
}
+static void notify_spus_active(void)
+{
+ int node;
+ for (node = 0; node < MAX_NUMNODES; node++) {
+ struct spu *spu;
+ mutex_lock(&spu_prio->active_mutex[node]);
+ list_for_each_entry(spu, &spu_prio->active_list[node], list) {
+ struct spu_context *ctx = spu->ctx;
+ blocking_notifier_call_chain(&spu_switch_notifier,
+ ctx ? ctx->object_id : 0, spu);
+ }
+ mutex_unlock(&spu_prio->active_mutex[node]);
+ }
+
+}
+
int spu_switch_event_register(struct notifier_block * n)
{
- return blocking_notifier_chain_register(&spu_switch_notifier, n);
+ int ret;
+ ret = blocking_notifier_chain_register(&spu_switch_notifier, n);
+ if (!ret)
+ notify_spus_active();
+ return ret;
}
+EXPORT_SYMBOL_GPL(spu_switch_event_register);
int spu_switch_event_unregister(struct notifier_block * n)
{
return blocking_notifier_chain_unregister(&spu_switch_notifier, n);
}
+EXPORT_SYMBOL_GPL(spu_switch_event_unregister);
static inline void bind_context(struct spu *spu, struct spu_context *ctx)
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2006-12-08 15:04 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
[not found] <OF9A01B09B.B62F8342-ON8325723C.006A9343-8325723C.006B2236@us.ibm.com>
2006-12-06 22:04 ` [PATCH]Add notification for active Cell SPU tasks Maynard Johnson
2006-12-07 22:58 ` [Cbe-oss-dev] " Arnd Bergmann
2006-12-08 15:04 ` Maynard Johnson
2006-12-01 20:01 Maynard Johnson
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).