Conference PaperPDF Available

Making Offensive Play Predictable -Using a Graph Convolutional Network to Understand Defensive Performance in Soccer

Authors:
1
Making'Offensive'Play'Predictable'-'Using'a'Graph'
Convolutional'Network'to'Understand'Defensive'
Performance'in'Soccer'
'
Michael(Stöckl,(Thomas(Seidl,(Daniel(Marley(&(Paul(Power(|(Stats(Perform(
(
!
1.#Introduction#
1.1#Measuring#defensive#quality#in#soccer#
The!art!of!good!defending!is!to!prevent!something!from!happening!before!it!has!even!happened.!Virgil!Van!
Dijk!is!considered!one!of!the!best!defenders!in!world!soccer!as!he!has!the!ability!to!prevent!a!pass!being!made!
to!an! open!attacker!to!shoot!by!forcing! the! ball!carrier!to!pass!somewhere!else!less!dangerous.!!However,!
while!we!know!this!is!great!defending,!in!today’s!stats,!Van!Dijk!would!not!receive!any!acknowledgement.!!A!
defender’s!contribution!is!simply!measured!by!the!number!of!tackles!or!interceptions!they!make.!But!what!
if!we!were!able!to!measure!actions!that!have!been!prevented!before!they!were!made?!!
!!!
The! aim! of! a! defense! and! a! defender! is! to! make! offensive! play! predictable.! For! example,! Jürgen!Klopp’s!
Liverpool,!press!the!opposition!with!the!aim!of!forcing!them!to!give!the!ball!away!in!specific!areas!of!the!pitch!
by!limiting!the!number!of!passing!options! available!in! dangerous! areas.! !If! the! art! of! good!defending!is!to!
make!play!predictable,!then!it!should!be!measurable.!Given!enough!data,!we!should!be!able!to!predict!where!
a!player!will!pass!the!ball,!the!likelihood!of!that!pass!being!completed!and!whether!this!pass!will!result!in!a!
scoring!opportunity.!It!therefore!stands!that!we!should!be!able!to!measure!if!a!defender!forces!an!attacker!to!
change!their!mind!or!to!prevent!an!attacker!from!even!becoming!an!option.!!!
!
Figure 1 shows a situation from a match between Liverpool vs Bayern Munich in the 2018/19!UEFA!Champions!
League!that!leads!to!Mané!(red!10)!scoring.!!Our!model!identifies!that!Milner!(red!7)!is!the!primary!target!for!
Van!Dijk!(red!4)!in! the! first! instance.!However,!due!to!the!combination!of!Gnabry!(blue!22)! closing! down!
Milner,!Lewandowski!(blue!9)!closing!down!Van! Dijk! and! Mané! making! an!active'run,!behind!the!defence,!
Mané!becomes!both!the!most!likely!receiver!and!a!high!threat!for!scoring.!!This!demonstrates!our!ability!to!
model!how!players!decision!making!is!influenced!and!how!a!situation!can!move!from!low!threat!to!high!threat!
by!the!off-ball!actions!of!attackers!and!defenders.![LINK!TO!VIDEO].!
!
In!this!paper!we!present!a!novel!Graph!Convolutional!Neural!Network!(GNN)!which!is!able!to!deal!with!highly!
unstructured!and!variable!tracking!data!to!make!predictions!in!real!time.!!This!allows!us!to!accurately!model!
defensive!behaviour!and!its!effect!on!attacking!behaviour,!i.e.,!preventing'actions'before'they!have'occurred.!!!
"
To!do!this!we!trained!the!following!models:!
- !"#$#%&#'(!Predicts!the!likelihood!of!every!player!becoming!the!pass!receiver!at!any!moment!within!
a!player!possession.!!!
- !)*'#+,(-Predicts!the!probability!of!a!shot!occurring!in!the!next!10!seconds!if!a!pass!was!played!to!
an!attacker.!!!
- !.+//:!Predicts!how! likely! a!pass!would!be!completed!to!each!attacker! off! the! ball! at!any!moment!
within!a!player!possession.!!
!
!
2
and!introduce!new!defensive!concepts:!
!
- .0+1#'- 2&+%0+3%0%,1:! Using! the! outputs! from! xReceiver! and! xPass! we! infer! how! available! every!
attacker!is!off!the!ball!at!each!frame.!
- 4#5#6/%&#- 789+$,:! We! are! able! to! detect! high! level! defensive! concepts! such! as! ball! and! man!
orientated!defending,!defensive!position!play!and!off!ball!runs.!!
- 4%/':9,%;6- <+9/:! Global! visual! representations! of! defending! teams’! ability! to! disrupt! the!
oppositions!attacking!strategy.!
-
=%>:'#-?(-We!train!three!models!(!.+//@-!"#$#%&#'-A-!)*'#+,)!to!better!understand!defensive!and!offensive!
off!the!ball!behaviour,!such!as!8+6B;'%#6,+,#C-C#5#6C%6>,!3+00B;'%#6,+,#C-C#5#6C%6>-and!+$,%&#-;55-3+00-
':6/.!!This!lets!us!better!understand!how!Sadio!Mané!scored!a!1-0!lead!vs!Bayern!Munich!during!Liverpool’s!
way!to!the!UCL!final!in!2017/18.!LINK!TO!VIDEO!
"
"
"
"
"
"
"
"
"
"
"
"
# #
3
Related#Work#
#
1.2.1 Dealing#with#unstructured#data#
Tracking!data!is!highly!unstructured!and!can!be!difficult!to!model!due!to!most!machine!learning!techniques!
requiring!tabular!datasets!where!features!are!inserted!in!a!specific!order.!!To!solve!this!ordering!issue!Lucey!
et!al.![1]!presented!the!concept!of!aligning!players!to!a!formation!template![1,!2].!!However,!this!method!has!
several!limitations.!Firstly,!in!soccer,!teams!use!different!formations!so!a!player!at!role!10!for!a!team!using!
433!would!be!very!different!to!a!team!playing!352.!!Therefore,!it!is!difficult!to!compare!predictions!between!
these!players.!!In!addition,!this!method!is!reliant!upon!teams!having!the!same!number!of!players!on!the!pitch!
(11!per!team)!meaning! different! models!have!to!be!learned!when!players!have! been! sent! off!for!example.!!
With!regards!to!real!time!inference!a!further!limitation!is!the!speed!at!which!players!need!to!be!aligned!before!
calculating!features!for!inference.!!"
!
Fernandez![3]!and!Brefeld![4]! use! Convolutional! Neutral!Networks!(CNNs)!on! an! image! representation!of!
tracking! data! to!circumvent! the! ordering/alignment! issue! while! predicting! probabilistic! pitch! control!
surfaces.!!Converting!tracking!data!directly!to!images!is!suboptimal!as!one!gives!up!a!very!low-dimensional!
data! set! and! converts! it! into! a! high-dimensional! sparse! representation.!!Tracking! data! has! an! irregular!
structure!due!to!missing!players!and!a!lack!of!clear!scheme!to!order!players!in!a!sequence!or!frame![19].!Both!
techniques! are! also! time! consuming! and!could! cause! timing! issues! for! feature! generation! in! real-time!
applications.!!Instead!of!using!an!image-based!representation!we!used!Graph!Neural!Networks!(GNNs)!which!
1)!neglects!the!need!for!ordering!features,!2)!can!cope!with!varying!number!of!players!on!the!pitch!and!3)!
learns! local! and! higher! scale! features! directly! from! the! tracking! data.! Horton! applied!a! set-learning!
framework!to!model!passing!in!football!which!is!similar!to!a!simple!graph!(no!edges)![18].!
"
1.2.2 Evaluating#and#Predicting#Future#Actions#in#Sports#
Concepts!such!as!xThreat!and!xPass!are!not!new!with!previous!research!using!tracking!data!to!predict!the!
likelihood!of!a! pass! being!complete!or!a! goal! being! scored! after! a! specific! action![2,! 4,! 6].!! In! addition!to!
measuring!the!value!of!an!action,!the!concept!of!valuing!a!players’!off!ball!position,!has!also!been!investigated!
[2,! 3].! ! Spearman! [9]! also! developed! a! model! to! evaluate! off-ball! scoring! opportunities!in! soccer.! !These!
models!create!a!surface!area!combining!xThreat!and!xPass!values!to!understand!how!dangerous!a!team’s!or!
players!current!possession!is!and!also!what!space!they!control.!
!
Wei! [7,! 8]! modelled! the! probability! of! where! the! next! action! will! go! in! tennis! and! soccer.! Franks! [10]!
predicted!defensive! match! ups! and! measured! the! influence! of! defenders! on! the! offenses! shooting!
performance.!!Ghosting![11,!12]!hallucinated!where!a!team!of!defenders!will!move!to!next!based!on!where!
the!attackers!have! moved,!and! the!ball! is! moving! to.! !This! potentially! provides! useful! tools!to! assess! the!
defensive! strategy! of! teams!by! evaluating! the! difference! in! Expected! Goal! (xG)! or! Possession! Value!(PV)!
compared!to!a!global!baseline.!"
"
2.#Method#
2.1#Data#
To!train!and!validate!the!three!models!(xTransition,!xThreat,!and!xReceiver)!we!used!1,200!games!of!tracking!
data!from!multiple!seasons!of!Top!5!European!football!leagues!sampled!at!10Hz!per!second.!!Tracking!data!
consists!of!(x,! y)! positions!for!each!player!and!the! ball,! the!team!and!player!ids,!time,!half,! and! event!id!at!
each!frame.!!In!total,!the!dataset!consisted!of!one!million!passes!which!was!split!into!a!90/10!train!and!test!
set.!
4
!
For!learning!a!xThreat!model!only!frames!relating!to!the!moment!of!passing!events!were!considered.!For!the!
xTransition!and!xReceiver!models!we!included!tracking!data!from!not!only!the!individual! passes! but! also!
tracking!data!from!one!half-second!(5!frames)!and!one!second!(10!frames)!before!the!pass.!Including!these!
two!additional! moments! prior! to!each! pass! event! allowed!us!to! achieve! a!semantic!regularization! during!
training!preventing!the! model! to! overfit! to! the! pass!moment! where! players’!movements!already! indicate!
where!the!ball!will!be!played!to!some!degree.!!!
2.2#Graph#Convolutional#Network"
To!represent!the!tracking!data!in! a! well-defined! structure!that!avoids!ordering!issues,!we!used!a!graph.! A!
graph!G(V,!E,!U)!is!defined!by!nodes!V,!edges!E,!and!global!features!U.!In!our!representation,!as!shown!in!
Figure!2,!the!nodes!represent!the!player!and!ball!tracking!data,!and!the!edges!contain!information!about!the!
relationship!between!the!nodes.!No!global!features!were!included!in!this!approach.!The!edges!eij!are!directed!
and!connect!a!sending!node!vi!to!a!receiving'node!vj.!!
!
To!learn!the!relationship!between!the!graph!input!and!outputs,!we! used! a! GNN.!Specifically,!we!apply!the!
spatial!GNN!approach! that! includes!separate!operations,! known! as! blocks,!on! the! edges! and!nodes!of! the!
graph![13].!An!edge!block!is!defined!by!a!neural!network!that!takes!inputs!from!the!edge!features,!sending!
node!features,!receiving!node!features!and!outputs!a!new!edge!embedding.!!!
!
Similarly,!a!node!block!is!defined!by!a!neural!network!that!takes!inputs!from!the!node!features,!aggregated!
sending! edge! features,! aggregated! receiving! edge! features! and! outputs! a! new! node! embedding.! A!
permutation!invariant! function! is!required! to! aggregate! the!sending! and! receiving!edge! features,! e.g.,!the!
mean!or!sum!of!those!features.!We!used!a! similar! GNN! architecture!for!each!of!the!three!models:!an!edge!
block!followed! by! a!node! block.! Each!block! has! a! multilayer! perceptron!(MLP)!with!three! layers! and!the!
number!of!units!in!each!layer!varies!between!each!task.!!
!
Using!an!edge!block!in!the!GNN!allowed!the!network!to!learn!the!relationships!between!different!nodes!for!
each!task.!This!flexibility!is!not!present!in!standard!approaches!such!as!MLPs!or!convolutions,!where!the!
connections!between!inputs!are!defined.!
!
Each!model!outputs!a!prediction!for!each!player!from!the!final!node!block.!!The!prediction!on!each!node!is!
the!likelihood!of!a!player!receiving!the!next!pass!(xReceiver),!the!pass!will!be!completed!(xPass),!and!if!there!
will!be!a!shot!on!goal!within!the!next!10!seconds!(xThreat).!
!
Figure'2:""
Sketch" of" the" graph" representation" used" for" the"
tracking"data.""Individual"players"and"the"ball"are"
shown"as"nodes"in"the"graph"with"directed"edges"
connecting"them."""
"
Individual" players" and" the" ball" are" shown" as"
nodes"in"the"graph"connected"by"directed"edges."
"
Edges" are" weighted" to" allow" the" model" to" learn"
which"nodes"are"on"which"team."""
"
5
2.3#Features#
All!three!GNN!models! use!the!same!set!of! input!features.!Node!features! include!player!XY!position,!speed,!
acceleration,!angle!of!motion,!distance!and!angle!to!the!attacking!goal,!distance!to!the!ball!carrier,!difference!
in!the!angle!of!motion!to!the!ball!carrier,!and!a!flag!that!indicates!whether!the!player!is!the!ball!carrier.!The!
edge!features!include!a!flag!defining!the!relationship!between!the!two!nodes!(teammate!2,!or!opponent!1),!
the!distance!between!the!two!players,!and!the!difference!in!the!angle!of!motion.!
!
2.4#Training#
We!trained!a!unique!GNN!for!each!of!the!xTransition,!xThreat,!and!xReceiver!models
1
.!!Across!the!different!
tasks,!the!nodes! are! weighted! in! the!loss!calculation!to! stabilize! the! training.!The!nodes! representing! the!
defensive!players!were!masked!out!for!all!models.!During!the!training!of!the!xTransition!and!xThreat!model,!
the! only! players! considered! were! those! that! received! the! pass! or! were! the! intended! receivers.! For! the!
xReceiver!model!the!(intended)!receiver!of!a!pass!has!a!weight!of!1.0!and!all!other!teammates!of!the!ball!
carrier!have!a!weight!of!0.1!to!balance!the!signal!to!background.!
-
2.5#Model#Training#Results#
We!compared!the!trained!graph!models!xReceiver,!xPass!and!xThreat!against!respective!baseline!models!we!
trained!earlier.!The!baseline!models!were!MLPs!that!were!based!on!many!handcrafted!features!considering!
the! players’! motion! characteristics! (speed,! acceleration,! moving! direction),! relationships! between! the!
players!shown!by!differences!in!the!motion!features!and!we!also!considered!the!ball!information!(where!it!is!
played!with!which!speed)!as!seen!in![11,!12].!!!
!
The!loss!and!accuracy!of!all!three!GNN!models!were!better!than!or!the!same!as!the!metrics!of!the!respective!
baseline!model.!Whereas!the!accuracy!was!calculated!the!same!for!all!models,!the!logloss!is!not!comparable!
for!the!xReceiver!models.!As!described!above,!the!logloss!of!the!GNN!xReceiver!contains!all!teammates!of!the!
passer,!even!though!most!of!them!with! small! weights*.! However,! the! logloss!for!the!baseline!xReceiver!is!
calculated!only!considering!the!player!a!pass!was!played!to.!In!total,!the!GNN!models!are!better!or!at!least!of!
the!same!quality!as!the!baseline!models!although!much!fewer!handcrafted!features!were!used!since!the!graph!
was! able! to! learn! the! spatial! relationships! between! the! players! on! the! pitch.!!Interestingly! the! baseline!
xReceiver!model!had!the!speed!of!pass,!angle!of!pass!and!distance!to!receiver!calculated!for!the!pass!moment.!
However,! these! features! were! not! included!in! the! GNN.! ! This! indicates! that! the! GNN! is! able! to! learn! the!
complex!dynamic!characteristics!of!football!based!on!the!player!motion!features!at!the!node!level!combined!
with!the!inter-player!motion!dynamics!and!team!identity!at!the!edge.!!This!is!what!allows!us!to!use!a!simpler!
feature!representation!allowing!faster!end!to!end!prediction!(0.05s).!!See!Appendix!A1!for!details.!!
'
Table!1:'Training'metrics'of'the'three'GNN'models'
'
logloss'
baseline"xPass""
0.29"
GNN"xPass""
0.28"
baseline"xReceiver""
0.70"
GNN"xReceiver""
0.07*"
baseline"xThreat"
0.16"
GNN"xThreat"
0.16"
1
!We!use!a!custom,!pure-TensorFlow!package!based!on!the!DeepMind!graph_nets!library!
[https://github.com/deepmind/graph_nets]!to!train!our!GNNs.!
6
#
#
#
#
=%>:'#-D(-Example!of!how!we!are!able!to!capture!emergent!behaviour.!2BE!show!specific!moments!within!a!
possession!of!Bayern!Munich!(blue!team).!=!shows!frame!level!estimates!for!xReceiver!(low!values:!blue,!high!
values:! red)! for! a! subset! of! offensive! players! throughout! the! possession.! Players! are! identified! by! jersey!
number.! Grey! ellipsoids! show! collective! man! orientated! defending! whereas! orange! ellipsoids! show! ball!
orientated!defending!from!Liverpool!(red!team).!!Green!ellipsoids!show!active!runs!from!the!blue!team.!F7GH-
)I-J74EI-----
7
#
3.#Measuring#the#Unmeasurable#
#
We! introduce! a! new! defensive! toolbox,! Defensive' Impact,!which! analytically! determines! how! much! the!
defending!team!disrupts!the!oppositions!play.!!We!could!measure!this!by!looking!at!where!teams!intercept!
the! ball,! however! as! discussed! in! Section! 1! defending! is! about! preventing' what' hasn’t' happened' yet.! We!
demonstrate! that! the! xReceiver! model! not! only! accurately! predicts! who! will! receive! the!ball! next! at! the!
moment!of!a!pass,!but!when!applied!at!the!frame!level,!captures!human!decision!making.!!We!find!the!model!
can!predict!who!will!receive!the!ball!in!advance!of!a!pass!being!made!and!also!(more!impressively)!when!a!
player!changes!his!mind!and!more!importantly,!why.!Our!xThreat!and!xPass!models!allow!us!to!value!not!just!
what!did!happen!but!what!could!have!happened!or!more!accurately!what!was!prevented!(Figure!3!link!to!
video).!
!
In!the!following!section!we!will!demonstrate!how!we!created!the!features!that!power!Defensive!Impact!and!
the!new!insights!that!are!now!possible!to!be!generated!using!an!example!game!between!Lazio!vs!Juventus!
from!2018/19!Serie!A!season.!
#
3.1#Disruption#Maps#
While!we!are!able!to!create!very!granular!insights!with!Defensive!Impact,!the!ability!to!find!summary!insights!
is!equally!important.!!To!provide!compressed!representations!of!our!insights!we!introduce!Disruption'Maps.''
A!Disruption!Map!is!a!weighted!2d!distribution!that!shows!where!a!team,!positively!or!negatively,!disrupted!
the!oppositions!off!ball!options.!!To!calculate!a!Disruption!Map,!we!first!generate!a!‘spatial'identity’!(Figure!
4! top! left)! for! each! team! for! their! xReceiver,! xThreat! and! xPass! model! output.! ! This! acts! as! a! global!
representation!of!a! team!(see! Appendix!A2!for!outputs!for!all!teams!in!2018/19!Serie! A! season).! We!then!
calculate!the!same!surface!at!a!game!level!(Figure!4!top!middle)!and!subtract!the!two!surfaces!to!create!the!
Disruption!Map!(Figure!4!top!right).!!This!final!image!reveals!where!the!opposition!disrupts!(or!not!in!some!
cases)!the! oppositions! normal!strategy/flow
2
.!! We!can!now!use!these!maps!to!determine!which! team! had!
the!biggest!impact!on!their!oppositions!attacking!style!and!efficiency.!!We!are!also!able!to!break!this!down!
at!a!player!level!to!see!who!was!target!both!on!and!off!the!ball.--
-
3.2#Lazio’s#“Stellungsspiel”#-#Defensive#Position#Play""
Despite! Lazio! being! beaten! 1-2,! Juventus’! Giorgio! Chiellini! stated! post! game! "It! was! the! worst! Juventus!
performance!of!the!season!for!the!first!60!minutes”![14].!We!used!our!Disruption!Maps!to!understand!what!
Lazio! did! to! affect! Juventus! so! much! and! which! players! they! targeted! Figure! 4! bottom! shows! Juventus’!
Disruption!Maps!for!the!game.!We!can!clearly!see!Lazio!considerably!decreased!the!xThreat!from!their!left!
side!of!the! penalty! box! (Ronaldo’s!side),!with!a! 10%! decrease! in!Juventus’!xThreat!compared! to! all! other!
Juventus!games!in!that!season.!!In!addition,!we!can!also!see!how!Lazio!increased!the!risk!of!completing!a!pass!
to!any!option!in!the!wide!channels!between!the!touchline!and!six-yard!box!and!in!the!“second!penalty!area”!
outside!the!penalty! area!by!20%.!!These! are! the!most!dangerous!zones!where! shots!and!goals!are!created!
and!the!spaces!that!Juventus’!three!primary!attackers!operate.!!With!regards!to!where!players!were!likely!to!
be!a!target!for!a!pass,!we!can!see!that!xReceiver!was!higher!than!average!around!Lazio’s!penalty!box!however,!
as!discussed,! these! areas!had!a!significantly!higher! risk!of!competition!and!more!importantly! significantly!
lower!probability!of!leading!to!goal!even!if!they!were!completed.!
2
!Disruption!Maps!estimate!tactical!deviation!from!an!average!style,!where!differences!can!be!attributed!to!
opposition!and/or!changes!of!your!own!team’s!tactic!for!a!specific!game.!To!differentiate!between!the!two!
is!out!of!the!scope!of!this!paper.!
8
!
!
!
=%>:'#-KL- - Top!row!shows! how! Disruption!Maps! are! composed;!they! are! the!difference! between-+- Team!
Identity!and!a!Game!Identity;!Bottom!row!shows!Juventus’!Disruption!Maps!for!xReceiver,!xPass,!and!xThreat!
!
To!re-emphasise!how!poor!Juventus’!performance!was!viewed,!the!German!football!magazine!Kicker!stated!
in!the!first!half,!“Cristiano! Ronaldo! was! out! of! the! game,!and!Dybala!also!failed!to!give! momentum! to! the!
Bianconeri's!offense.”! [14].! !Our! player! Disruption! Maps!back!this! up! (Figure!5).!Ronaldo’s! probability! of!
receiving!a!pass!was!below!average!across!the!pitch!in!first!half!especially!in!right!channels.!This!marginally!
improved!in!the!second!half!but!only!in!the!deep!right!channel.!!His!threat!didn’t!exist!in!the!first!half!(Figure!
5!lower!left)!however,!he!took!up!dangerous!positions!on!the!left!channel.!But,!as!shown!in!his!second!half!
xReceiver!Disruption!Map!(Figure!5!upper!right)!he!was!never!viewed!as!a!good!option!to!pass!to.!
!
Dybala!took!up!significantly!more!dangerous!positions!on!the!left!of!the!penalty!area!and!around!the!penalty!
spot.!!The!risk!of!completing!the!pass!was!significantly!lower!as!well!indicating!he!took!both!threatening!and!
low! risk! positions! in! attacking! areas.! ! However,! he! was! rarely! viewed! as! a! realistic! passing! option! (low!
xReceiver)!with!him!only!being!above!average!on!the!left!flank!and!the!middle!of!the!pitch.!!This!is!the!critical!
insight!we!are!now!able!to!surface!compared!to!other!methods.!!We!are!not!just!able!to!measure!if!a!player!
is!in!a!good!position!(high!threat/low!risk),!we!can!measure!if!they!are!a!realistic!option!to!receive!the!ball.!!
To!understand!why!Dybala!was!not!a!viable!option!despite!his!good!positioning!we!need!to!go!a!deeper!level!
of!analysis!which!is!out!of!scope!for!this!paper.!
9
!
!
=%>:'#-M(-Player!xReceiver!and!xThreat!Disruption!Maps!by!half:!Left!Ronaldo,!Right!Dybala.!Blue!areas!are!
below!average!values!whereas!red!indicates!above!average!regions.-
-
4.#Going#Deeper#–#Measuring#Decision#Making!
#
A!primary! defensive! tactic!in!soccer!is!to!target! key! players!in!possession.!!These!may! be!players!who!are!
important!to!the!teams!build!up!or!conversely!players!who!are!more!likely!to!give!the!ball!away!and!cause!
transition!moments.!!Targeting!can!happen!in!three!ways,!ball'orientated'where!defenders! will!specifically!
target!the!player!in!possession!through!pressure;!man!orientated,!where!the!ball!carrier!is!not!targeted!but!
their!passing!options!are;!or!a!combination!of!both!with!the!aim!of!either!causing!a!transition!or!for!the!ball!
to!be!played!to!a!less!dangerous!area![16]!(Figure!3).!
!
These!high-level!complex!coaching!concepts! are!incredibly!difficult!to!capture!and!rely! on!domain!experts!
watching!hours!of!video!to!find!and!analyze.!!However,!by!using!the!output!from!the!xReceiver!model,!we!can!
create!a!detector! to! find!these!moments!by! simply! finding!when! the! primary! target!changes!for! the! ball!
carrier.!!We!define!the!primary!target!as!the!attacker!with!the!highest!xReceiver!value!at!each!frame.!!If!there!
is!a!change!in!the!primary!target,!we!make!an!assumption!that!a!proactive!action!has!occurred.!This!could!be!
a!run!from!a!supporting!attacker,!the!player!in!possession!moving!with!the!ball!or!a!defender(s)!actions.!
!
An!obvious! next! step! could!be! to! train! a!model! to! predict!these!situations.! ! However,!these!labels! do! not!
currently!exist!and!to!ask!a!set!of!domain!experts!to!annotate!thousands!of!examples!would!be!highly!time!
consuming!and!expensive.!!Instead,!we!used!a!programmatic!labelling!functions!approach![17]!and!asked!a!
domain! expert! to! identify! simple! functions! that! capture! the! expected! behaviour! of! players! for! different!
defensive!contexts!(see!Appendix!A3!for!a!sample).!!Based!on!observing!our!initial!detected!moments!with!a!
domain!expert,!we!defined!three!defensive!and!two!attacking!situations!(Table!3).!
!
!
!
10
!
Table!3:'Defensive'off'ball'contexts'and'their'definitions'as'defined'by'the'domain'expert.'
!
#
#
4.1#Ronaldo:#Null#and#Void#
We!can! see! from! the! Disruption! Maps!that! in! the! first!half! Lazio! were! able!to! nullify! Ronaldo! as!a! viable!
passing!option.!However,!how!did!they!do!this?!!Using! our!Defensive!Impact!toolbox,!we!can!now!assess!if!
Ronaldo!was!targeted!when!in!possession!(ball!orientated!defending)!and!out!of!possession!(man!orientated!
defending).!!In!addition,!we!can!assess!if!Ronaldo!made!any!active!runs!to!become!more!available!for!a!team!
mate.!!!
-
4.1.1 Targeting#the#Supply#Line#
To!understand!why!Ronaldo!went!missing!in!the!first!half,!we!identified!the!primary!players!who!pass!the!
ball!to!Ronaldo!from!the!last!5!games!(Table!5).!We!chose!the!last!5!games!as!this!is!how!opposition!scouts!
would!do!it.!!!We!measured!how!often!Ronaldo!was!identified!as!a!primary!target!for!these!players!(Table!6).!!
We!clearly! see! that! the!relationship!between!Matuidi! and! Ronaldo!is! heavily!targeted,!with! Ronaldo! only!
being!considered!an!option!for!Matuidi!once!during!the!Lazio!match.!We!also!see!the!difference!in!Ronaldo’s!
performance!between!the!first!and!second!half,!with!Ronaldo!only!being!identified!5!times!in!the!first!and!8!
times!in!the!second.!!This!further!supports!the!observations!in!Kicker.!!
!
4.1.2 Targeting#Ronaldo#in#Possession#
Ronaldo! was! targeted! 8! times! in! the! game! when! in! possession! with! Lazio! applying! both! ball! and! man!
orientated!defending.!!Figure!6!(right)!shows!a!primary!example!of!how!Lazio!closed!down!Ronaldo’s!options!
by!applying!pressure!to!Ronaldo!as! he! carried! the! ball!and!also!applying! pressure! to! his! primary! targets;!
Matuidi!(red!14)!and!Sandro!(red!12).!!We!can!see!how!Sandro!made!an!active!run!to!become!the!primary!
target!and!how!Correa!(blue!11)!attempted!to!apply!man!orientated!pressure.!!However,!the!defensive!run!
of!Luis!Alberto!(blue!10)!to!press!Ronaldo!was!responsible!for!the!decrease!in!Sandro’s!xReceiver!value.!!!At!
the!same!time! Matuidi! (red! 14)!became!the!most!likely!receiver!(xR!0.98),!however,! due! to! the! combined!
pressure!of!Milinković-Savić! (blue!21)!and!Parolo!(blue! 16)!his!xPass!was!only!0.66,! meaning! there!was!a!
high!chance!of!a!transition.!
!
!
!
!
!
Situations'
Definition'
Ball"orientated"defending"
Defender"moves"closer"to"the"ball"carrier"at"high"speed"to"increase"the"chance"of"the"
ball"being"given"away"or"pass"to"less"dangerous"area.'
Man"orientated"defending"
Defender"moves"closer"to"the"primary"target"to"reduce"the"probability"of"them"being"a"
receiver.'
Ball"and"man"orientated"
defending"
Combination"of"the"previous"two.'
Active"off"Ball"Runs"
An"attacker"moves"at"high"speed"to"increase"their"probability"of"being"a"receiver.'
11
!
Table!5.'Top'3'passers'who'started'the'game'from'the'previous'5'matches'
'
'
-
-
-
'
'
'
Table!6.'Ronaldo’s'availability'for'his'three'primary'suppliers'
-
-
!
!
!
-
-
Table!7.'The'availability'of'Ronaldo’s'three'primary'suppliers'when'he'is'in'possession'
-
!
Option'
Ronaldo'In'Possession'!
First'Half!
Ronaldo'In'Possession'!
Second'Half'
Paulo"Dybala"
4"
4"
Blaise"Matuidi"
2"
1"
Alex"Sandro"
3"
6"
!
We!have!demonstrated!how!effective!Lazio’s!defensive!play!was!in!nullifying!Ronaldo!and!how!they!did!it.!
The!other!aspect! we! can!examine!is! if! Ronaldo!actively!tried! to! make! himself!available.!! Ronaldo! made! 7!
active!runs!(Figure!6!Right!F%6N-,;-J%C#;)!however!only!3!were!attacking!forward!runs.!!If!we!compare!this!
to!Lazio’s!forwards,!Immobile!and!Luis!Alberto!(Figure!7!Left),!we!see!they!were!more!dangerous!with!their!
runs.!!Figure!6!(Left)!shows!an!example!of!Ronaldo!making!an!active!run!into!the!penalty!area!to!become!a!
primary! target! for! Douglas! Costa.! ! When! Costa! received! the! ball! Ronaldo! only! had! an! xReceiver! of! 0.22.!!
However,!as!we!follow!his!run,!we!can!see!him!becoming!the!primary!target!at!the!edge!of!the!penalty!area.!!
Interestingly,!the!plot!reveals!he!was!always!in!a!highly!threatening!position!for!the!entirety!of!the!move!by!
playing!on!the!shoulder!of!the!last!defender.!
Passer'
Pass'Total'
Games'Played'Together'
Pass'Per'90'
Alex"Sandro"
31"
4"
7.23"
Matuidi"
22"
4"
5.31"
Dybala"
22"
4"
5.29"
Supplier'
Ronaldo'Target'!
First'Half'
Ronaldo'Target'!
Second'Half'
Paulo"Dybala"
1"
5"
Blaise"Matuidi"
1"
0"
Alex"Sandro"
3"
3"
12
-
=%>:'#-OL-F#5,(-Example!of!Ronaldo!(7)!making!an!active!off!ball!run!to!become!the!most!likely!receiver.!!His!
trail!shows!both!how!dangerous!his!run!was!and!how!his!xReceiver!increased.!"%>*,(-Ronaldo!being!targeted!
with!combined!man!orientated!and!ball!orientated!defending.!LINK!TO!VIDEO!!!
#
-
=%>:'#-PL-Active! Run! Maps! for! Lazio’s!(Ciro!Immobile!and!Luis! Alberto)!and! Juventus’! (Paulo! Dybala!and!
Christiano!Ronaldo)!main!attackers.--!
13
"
4.#Summary#
#
We!proposed!a!novel!GNN!architecture!that!allows!to!deal!with!unstructured!data.!The!GNN!allows!us!to!
directly!learn!from!a!lightweight!feature!representation!the!collective!behaviour!of!the!soccer!players!on!the!
pitch.!!This!circumvents!the!need!to!order!players!and!for!heavy!feature!crafting!which!allows!us!to!apply!an!
end-to-end!inference!pipeline!in!real-time!applications.!
!
Based!on!the!model!outputs,!we!introduced!Defensive!Impact,!a!toolbox!to!measure!the!influence!of!defensive!
strategy! on! the! opposition! at! a! team! and! player! level.! ! Disruption! Maps! create! a! compact! visual!
representation!of!a!team’s!effect!on!the!oppositions!xThreat,!xPass!and!xReceiver!values!compared!to!their!
global!overage.! ! It! allows! users! to! determine! where! a! defence! has!been! successful! or!struggled! based! on!
reducing/increasing!an!oppositions!threat,!pass!risk!and!player!availability.!!
!
We!were!able!to!identify!different!defensive!styles!(man!and!ball!orientated!defending)!and!off!ball!runs.!!Due!
to!the!lack!of!labelled!data!we!utilised!a!task!programming!approach!–!creating!labelling!functions!based!on!
a!domain!expert’s!insights.!Moving!forward!an!active!learning!approach!where!labels!are!generated,!trained!
against!and!then!assessed!would!be!a!recommended!next!step.!!
!
This!research!enables!the!evaluation!of!defensive!behaviour!and!provides!tools!and!insights!for!coaches!and!
fans!to!engage!with.!!
!
!
"
"
" "
14
"
5.#References#
#
[1]!Xinyu!W.,!Long!S.,!Lucey,!P.,!Morgan,!S.,!&!Sridharan,!S..!Large-scale!analysis!of!formations!in!soccer.!In!
Digital!Image!Computing:!Techniques!and!Applications!(DICTA),!2013!International!Conference!on,!
pages!18.!IEEE,!2013.!
[2]!Power,!P.,!Ruiz,!H.,!Wei,!X.,!&!Lucey,!P.!(2017).!“Not!all!passes!are!created!equal:”!Objectively!measuring!
the!risk!and!reward!of!passes!in!soccer!from!tracking!data.!Proceedings'of'the'ACM'SIGKDD'
International'Conference'on'Knowledge'Discovery'and'Data'Mining.!
[3]!Fernández,!J.,!&!Bornn,!L.!(2020).!SoccerMap:!A!Deep!Learning!Architecture!for!Visually-Interpretable!
Analysis!in!Soccer.!European'Conference'on'Machine'Learning'and'Principles'and'Practice'of'Knowledge'
Discovery'in'Databases'(ECML-PKDD).!
[4]!Brefeld,!U.,!Lasek,!J.,!&!Mair,!S.!(2019).!Probabilistic!movement!models!and!zones!of!control.!Machine'
Learning,!108(1),!127147.!https://doi.org/10.1007/s10994-018-5725-1!
[5]!Kim,!K.,!Grundmann,!M.,!Shamir,!A.,!Matthews,!I.,!Hodgins,!J.,!&!Essa,!I.!(2010).!Motion!fields!to!predict!
play!evolution!in!dynamic!sport!scenes.!Proceedings'of'the'IEEE'Computer'Society'Conference'on'
Computer'Vision'and'Pattern'Recognition,!840847!
[6]!Cervone,!D.,!D’amour,!A.,!Bornn,!L.,!&!Goldsberry,!K.!(2014).!POINTWISE:!Predicting!Points!and!Valuing!
Decisions!in!Real!Time!with!NBA!Optical!Tracking!Data.!MIT'Sloan'Sports'Analytics'Conference.!
[7]!Wei,!X.,!Lucey,!P.,!Morgan,!S.,!Reid,!M.,!&!Sridharan,!S.!(2016).!“The!Thin!Edge!of!the!Wedge”:!Accurately!
Predicting!Shot!Outcomes!in!Tennis!using!Style!and!Context!Priors.!MIT'Sloan'Sports'Analytics'
Conference.!
[8]!Wei,!X.,!Lucey,!P.,!Vidas,!S.,!Morgan,!S.,!&!Sridharan,!S.!(2015).!Forecasting!events!using!an!augmented!
hidden!conditional!random!field.!Lecture'Notes'in'Computer'Science'(Including'Subseries'Lecture'Notes'
in'Artificial'Intelligence'and'Lecture'Notes'in'Bioinformatics),!9006,!569582.!
https://doi.org/10.1007/978-3-319-16817-3_37!
[9]!Spearman,!W.!(2019).!Beyond!Expected!Goals.!MIT'Sloan'Sports'Analytics'Conference.!
[10]!Franks,!A.,!Miller,!A.,!Bornn,!L.,!&!Goldsberry,!K.!(2015).!Counterpoints:!Advanced!Defensive!Metrics!
for!NBA!Basketball.!MIT'Sloan'Sports'Analytics'Conference.!!
[11]!Seidl,!T.,!Cherukumudi,!A.,!Hartnett,!A.,!Carr,!P.,!&!Lucey,!P.!(2018).!Bhostgusters:!Realtime!Interactive!
Play!Sketching!with!Synthesized!NBA!Defenses.!MIT'Sloan'Sports'Analytics'Conference.!
[12]!Le,!H.!M.,!Carr,!P.,!Yue,!Y.,!&!Lucey,!P.!(2017).!Data!Driven!Ghosting!using!Deep!Imitation!Learning.!MIT'
Sloan'Sports'Analytics'Conference.!
[13]!Battaglia,!P,!et!al.!(2018).!Relational!inductive!biases,!deep!learning,!and!graph!networks.!'
[14]!https://www.espn.com/soccer/report?gameId=522613!!
[15]!https://www.kicker.de/lazio-gegen-juventus-2019-serie-a-4509058/spielbericht!(in German)!
[16]!https://spielverlagerung.com/2014/07/07/counterpressing-variations/
[17]!Sun,!J.!J.,!Kennedy,!A.,!Eric,!Z.,!Yue,!Y.,!&!Perona,!P.!(2020).!Task!Programming:!Learning!Data!Efficient!
Behavior!Representations.!arXiv'Preprint,!(2011.13917).!
[18]!Horton,!M.!(2020).!Learning!Feature!Representations!from!Football!Tracking.!MIT'Sloan'Sports'
Analytics'Conference.!
[19]!Mehrasa,!N.,!Zhong,!Y.,!Tung,!F.,!Bornn,!L.,!Mori,!G.!(2018)!Deep!Learning!of!Player!Trajectory!
Representations!for!Team!Activity!Analysis.!MIT'Sloan'Sports'Analytics'Conference!
#
15
A. Appendix#
#
#A1.#Speed#of#Inference#
Our!three!GNN!models!use!the!same!features!and!the!number!of!features!is!rather!small!as!described!above.!
This!raised!our!interest!whether!these!models!are!applicable!in!real-time!or!at!least!close!to!real-time.!We!
ran!a!test!how!long!the! feature! engineering! (custom! python! code)! and!a!GNN!model!inference!of!a!single!
frame!take!on!an!off-the-shelf!notebook
3
,!respectively.!Table!2!shows!that!on!average!the!whole!process!from!
crafting!the!features!until!the!inference!of!the!model!is!made!took!less!than!0.05s!for!one!tracking!timestamp!
on!our!test!notebook.!This!enables!us!to!use!our!GNN!models!nearly!in!real-time.!!
!
!
Table!A.1:'Speed'tests'how'long'feature'crafting'and'the'model'inference'takes'in'seconds'[s];'descriptive'results'
of'1000'attempts'
!
=#+,:'#-$'+5,%6>-Q/R!
765#'#6$#-Q/R!
);,+0-Q/R!
mean!(s)!
0.044!
0.001!
0.045!
std!
0.004!
0.0002!
0.006!
min!
0.038!
0.0007!
0.04!
max!
0.068!
0.0029!
0.08!
-
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
#
3
MacBookPro with a 2.3 GHz Intel Core i5 processor and 16GB RAM
16
#
#
A2.#Spatial#identity#maps#for#all#teams#from#Serie#A#season#2018/19#ordered#by#league#table#
#
#
=%>:'#-2L?(-SSpatial!Identity!for!xReceiver”!for!all!teams!from!Serie!A!2018/19.!Teams!playing!from!left!to!
right.- -
17
-
-
-
#
-
=%>:'#-2LT(-SSpatial!Identity!for!xPass”!for!all!teams!from!Serie!A!2018/19.!Teams!playing!from!left!to!right.
- -
18
-
-
#
-
=%>:'#-2LD(-SSpatial!Identity!for!xThreat”!for!all!teams!from!Serie!A!2018/19.!Teams!playing!from!left!to!
right.- -
19
#
A3.#Sample#pseudo#code#for#a#programmatic#labelling#function#
#
Based!on!discussions!with!domain!experts!the!task!of!identifying!man!orientated!defending!was!translated!
into!the!following!pseudo!code:!
!
Defender!moves!closer!to!the!primary!target!to!reduce!the!probability!of!them!being!a!receiver.”!
!
=%>:'#-2LK(-Pseudo!code!for!programmatic!labelling!of!man'orientated'defending.-
... Graphs for sports analytics Graphs have been widely used for modeling team ball sports as they represent the players as nodes of a network and their interactions as edges, for example in water polo [36], volleyball [37], soccer [1,42], etc. Graphs can capture patterns of a game at individual, group, and global level [7]. As an example of the global level, in [37] it is presented a deep architecture that models volleyball players and their individual actions to predict team activities using recursive networks and spatiotemporal attention mechanisms. ...
... At the group level, in [1] it is proposed a graph model that detects tactical patterns from soccer games videos by tracking each player and using variational autoencoders. At the individual level, in [42] defensive quality of individual soccer players is measured by using pitch-player passing graph networks. Since our graph-based model only relies on visual information, our approach involves locating the players in the pitch and classifying them in comparison with [1]. ...
Preprint
Full-text available
Action spotting in soccer videos is the task of identifying the specific time when a certain key action of the game occurs. Lately, it has received a large amount of attention and powerful methods have been introduced. Action spotting involves understanding the dynamics of the game, the complexity of events, and the variation of video sequences. Most approaches have focused on the latter, given that their models exploit the global visual features of the sequences. In this work, we focus on the former by (a) identifying and representing the players, referees, and goalkeepers as nodes in a graph, and by (b) modeling their temporal interactions as sequences of graphs. For the player identification, or player classification task, we obtain an accuracy of 97.72% in our annotated benchmark. For the action spotting task, our method obtains an overall performance of 57.83% average-mAP by combining it with other audiovisual modalities. This performance surpasses similar graph-based methods and has competitive results with heavy computing methods. Code and data are available at https://github.com/IPCV/soccer_action_spotting.
... Graphs for sports analytics. Graphs have been widely used for modeling team ball sports as they represent the players as nodes of a network and their interactions as edges, for example in water polo [36], volleyball [37], soccer [1,42], etc. Graphs can capture patterns of a game at individual, group, and global level [7]. As an example of the global level, in [37] it is presented a deep architecture that models volleyball players and their individual actions to predict team activities using recursive networks and spatio-temporal attention mechanisms. ...
... At the group level, in [1] it is proposed a graph model that detects tactical patterns from soccer games videos by tracking each player and using variational autoencoders. At the individual level, in [42] defensive quality of individual soccer players is measured by using pitch-player passing graph networks. Since our graph-based model only relies on visual information, our approach involves locating the players in the pitch and classifying them in comparison with [1]. ...
... In order to make the predicted result of the pass more realistic, [97] predicts the result of the final pass by introducing two concepts: the time to intercept and the time to control. Specifically, the time to intercept is used to calculate whether the player can intercept the ball on the trajectory of the ball. ...
Article
Due to ever-growing soccer data collection approaches and progressing artificial intelligence (AI) methods, soccer analysis, evaluation, and decision-making have received increasing interest from not only the professional sports analytics realm but also the academic AI research community. AI brings game-changing approaches for soccer analytics where soccer has been a typical benchmark for AI research. The combination has been an emerging topic. In this paper, soccer match analytics are taken as a complete observation-orientation-decision-action (OODA) loop. In addition, as in AI frameworks such as that for reinforcement learning, interacting with a virtual environment enables an evolving model. Therefore, both soccer analytics in the real world and virtual domains are discussed. With the intersection of the OODA loop and the real-virtual domains, available soccer data, including event and tracking data, and diverse orientation and decision-making models for both real-world and virtual soccer matches are comprehensively reviewed. Finally, some promising directions in this interdisciplinary area are pointed out. It is claimed that paradigms for both professional sports analytics and AI research could be combined. Moreover, it is quite promising to bridge the gap between the real and virtual domains for soccer match analysis and decision-making.
... Another approach consists of using sequential generative models of multiagent game dynamics (e.g., basketball) that explicitly model -and therefore permit inferring from data -'cognitive' constructs that have interpretable semantics, such as individual goals and group strategies [195]. It is interesting to stress that this, as with other similar approaches, has the potential to address novel questions and characterize subtle aspects of the game, including decision-making strategies at the individual level [38], off-ball positioning strategies at the team level [196,197], and defensive techniques and performance [198]. It then appears clear that recent advances in sport analytics can be reused to address general research questions about cognitive processing, particularly for characterizing performance and decision strategies (at both individual and collective levels) in rich environments involving the contextconstrained interactions of multiple agents. ...
Article
Psychology and neuroscience are concerned with the study of behavior, of internal cognitive processes, and their neural foundations. However, most laboratory studies use constrained experimental settings that greatly limit the range of behaviors that can be expressed. While focusing on restricted settings ensures methodological control, it risks impoverishing the object of study: by restricting behavior, we might miss key aspects of cognitive and neural functions. In this article, we argue that psychology and neuroscience should increasingly adopt innovative experimental designs, measurement methods, analysis techniques and sophisticated computational models to probe rich, ecologically valid forms of behavior, including social behavior. We discuss the challenges of studying rich forms of behavior as well as the novel opportunities offered by state-of-the-art methodologies and new sensing technologies, and we highlight the importance of developing sophisticated formal models. We exemplify our arguments by reviewing some recent streams of research in psychology, neuroscience and other fields (e.g., sports analytics, ethology and robotics) that have addressed rich forms of behavior in a model-based manner. We hope that these "success cases" will encourage psychologists and neuroscientists to extend their toolbox of techniques with sophisticated behavioral models - and to use them to study rich forms of behavior as well as the cognitive and neural processes that they engage.
... GNNs leverage their ability to unravel these convoluted relationships, allowing for greater insights into the structure and behavior of these networks. This approach has garnered attention across various industrial sectors including robotics and autonomous systems [247], [248], [249], utility plants [49], public services [250], and sports analytics [251], [252], [253], [254], expanding the breadth of IoT applications. With a consistent track record of state-of-the-art results [45], GNNs have proven integral in numerous IoT applications, underpinning our understanding of these increasingly complex systems. ...
Preprint
Full-text available
Time series are the primary data type used to record dynamic system measurements and generated in great volume by both physical sensors and online processes (virtual sensors). Time series analytics is therefore crucial to unlocking the wealth of information implicit in available data. With the recent advancements in graph neural networks (GNNs), there has been a surge in GNN-based approaches for time series analysis. Approaches can explicitly model inter-temporal and inter-variable relationships, which traditional and other deep neural network-based methods struggle to do. In this survey, we provide a comprehensive review of graph neural networks for time series analysis (GNN4TS), encompassing four fundamental dimensions: Forecasting, classification, anomaly detection, and imputation. Our aim is to guide designers and practitioners to understand, build applications, and advance research of GNN4TS. At first, we provide a comprehensive task-oriented taxonomy of GNN4TS. Then, we present and discuss representative research works and, finally, discuss mainstream applications of GNN4TS. A comprehensive discussion of potential future research directions completes the survey. This survey, for the first time, brings together a vast array of knowledge on GNN-based time series research, highlighting both the foundations, practical applications, and opportunities of graph neural networks for time series analysis.
... Specifically, highly flexible teams may be able to (re)organize their defensive structures more appropriately, in response to the offensive tactical strategy of the opposition (Tenga et al., 2010a(Tenga et al., , 2010b or a match score situation (Santos et al., 2017), as they can adopt a wider range of playing styles. It is possible that highly flexible teams may be able to achieve greater defensive match performance indicators by deliberately transitioning into a playing style that constrains the attacking team, consequently forcing the opposition to play more predictably in the offensive phase (i.e., forcing opposition into the adoption of a playing style, involuntarily) (Stöckl et al., 2021). However, despite the significant association between playing style flexibility and match performance indicators highlighted in the current study, the effect sizes were small to moderate. ...
Article
Purpose The present study examined the relationship between playing style adaptability and team match performance indicators throughout the season. Three playing style adaptability metrics were analysed, namely, (1) flexibility (i.e., exhibiting a wide range of playing styles), (2) reactivity (i.e., adapting playing style based on opposition) and (3) imposition (i.e., executing predetermined playing style regardless of opposition). Methods Team playing styles were derived through a clustering analysis of 21,708 matches played in the top five male European leagues from 2014/15 to 2019/20. Spearman’s correlation was utilized to assess the association between the three playing style adaptability metrics and four team match performance indicators (e.g., shots taken in opposition penalty box; shots conceded in own penalty box; goals scored; goals conceded; and total wins). Results Playing style flexibility was positively associated with both offensive and defensive match performance indicators and win frequency. Conversely, playing style reactivity and imposition were negatively associated with these team match performance indicators. Conclusions Our results suggest that the capacity to exhibit a wide range of playing styles throughout a season is associated with greater team performance. Furthermore, it is possible that high performing teams are capable of functionally switching between playing style reactivity and imposition, depending on match dynamics.
... In these cases, there were differences in the fourteen variables that had a greater contribution to shaping the outcome of their matches depending on the philosophy of their coach and the tactical principles they adopted. For example, Liverpool manager Jurgen Klopp's preference for the "high press" is well known [62][63][64]. This is reflected in the results of our research, since three of the fourteen variables (high pressing percent, ball recoveries in opponent's half, ratio defensive challenges attacking 3rd plus defensive challenges midfield 3rd per defensive challenges) for Liverpool are related to this particular philosophy, while for the teams as a whole only one of them appeared. ...
Article
Full-text available
Understanding the performance indicators that contribute to the final score of a football match is crucial for directing the training process towards specific goals. This paper presents a pipeline for identifying key team-level performance variables in football using explainable ML techniques. The input data includes various team-specific features such as ball possession and pass behaviors, with the target output being the average scoring performance of each team over a season. The pipeline includes data preprocessing, sequential forward feature selection, model training, prediction, and explainability using SHapley Additive exPlanations (SHAP). Results show that 14 variables have the greatest contribution to the outcome of a match, with 12 having a positive effect and 2 having a negative effect. The study also identified the importance of certain performance indicators, such as shots, chances, passing, and ball possession, to the final score. This pipeline provides valuable insights for coaches and sports analysts to understand which aspects of a team's performance need improvement and enable targeted interventions to improve performance. The use of explainable ML techniques allows for a deeper understanding of the factors contributing to the predicted average team score performance.
... These have also been extended in (Merhej et al. 2021) which aims to value defensive actions based on the predicted xT of what has been stopped. Similarly, (Stöckl et al. 2021) looks to better understand defenders performances using graph convolutional networks and (Llana, Madrero, and Fernández 2020) aims to create more off-ball metrics for exploiting an opponent's spatial weaknesses. Finally, (Van Roy et al. 2021) aims to evaluate the decisions made by players in games and if they were optimal. ...
Thesis
The Sports Analytics Market is growing rapidly, in 2020 it was valued at over $1 billion and is expected to reach over $5 billion by 2026. However, even with this level of growth the use of Artificial Intelligence (AI) techniques have yet to fully be explored. The sports analytics domain presents a number of significant computational challenges for AI and Machine Learning. In this thesis, we propose a number novel methods for analysing team sports data to help sports teams utilise AI to improve their strategic and tactical decision making. By doing so, we present a number of contributions to the AI and sports analytics communities. In particular, we present a model for the tactical decisions that are made in football and show how game theoretic techniques can be used to optimise these. We focus on both the short-term decisions made for individual games, as well as longer term decisions to maximise performance over a season. We show that we can increase a teams chances of winning individual games by 16.1% and can increase a teams mean expected finishing position by up to 35.6%. We also, introduce a new model for valuing the teamwork between players in sports teams by assessing the outcomes of chains of interactions between the players in a team. We then present a novel model for forming teams based on this value and maximise teamwork by assessing the overlapping pairs in a team. Our model is shown to better predict the real-world performance of teams by up to 46% compared to models that ignore inter-agent interactions. Finally, we show how we can use natural language processing techniques to improve the traditional statistical methods for prediction sports match outcomes. We use domain expert written articles from the media to train our models and we show that by incorporating the features learned from the text, we can boost the accuracy of the traditional statistical methods by 6.9%.
... basketball and football), taking all player interactions into consideration. On static data, Stöckl et al. (2021) used GNN's to predict future events in football via nodepredictions. Dick et al. (2021) were the first to present edge predictions on football data using a graph recurrent neural networks (GRNN) in order to perform classification tasks on player interactions. ...
Article
Choosing the right defensive corner-strategy is a crucial task for each coach in professional football (soccer). Although corners are repeatable and static situations, due to their low conversion rates, several studies in literature failed to find useable insights about the efficiency of various corner strategies. Our work aims to fill this gap. We hand-label the role of each defensive player from 213 corners in 33 matches, where we then employ an augmentation strategy to increase the number of data points. By combining a convolutional neural network with a long short-term memory neural network, we are able to detect the defensive strategy of each player based on positional data. We identify which of seven well-established roles a defensive player conducted (player-marking, zonal-marking, placed for counterattack, back-space, short defender, near-post, and far-post). The model achieves an overall weighted accuracy of 89.3%, and in the case of player-marking, we are able to accurately detect which offensive player the defender is marking 80.8% of the time. The performance of the model is evaluated against a rule-based baseline model, as well as by an inter-labeller accuracy. We demonstrate that rules can also be used to support the labelling process and serve as a baseline for weak supervision approaches. We show three concrete use-cases on how this approach can support a more informed and fact-based decision making process.
Article
Applying deep reinforcement learning to football games has recently received extensive attention. However, this remains challenging due to the excessively high complexity of the football environment such as high-dynamical game states, sparse rewards, and multiple roles with different capabilities. Existing works aim to address these problems without considering abundant domain knowledge of football. In this paper, a football knowledge-embedded learning framework is proposed. Specifically, the pitch control concept is innovatively introduced to design a knowledge-embedded state representation. As a result, a novel pitch control model is designed that quantitatively provides space influence values of a single player, the whole team, and the ball. Different from existing models, this model additionally considers each player's various capabilities, including flexibility, explosive force, and stamina. Furthermore, the deformable convolution network is adopted for state representation extracting, which is used to process the geometric transformation of the players' positions and spatial influence values generated by the pitch control model. Then, based on this comprehensive state representation, a PPO-based reinforcement learning scheme is adopted to generate the final policy. Finally, extensive simulations, including learning against a fixed opponent and learning from self-play, clearly show the effectiveness and adaptability of our proposed framework.
Article
Full-text available
Coordinated movements of players are key to success in team sports. However, traditional models for player movements are based on unrealistic assumptions and their analysis is prone to errors. As a remedy, we propose to estimate individual movement models from positional data and show how to turn these estimates into accurate and realistic zones of control. Our approach accounts for characteristic traits of players, scales with large amounts of data, and can be efficiently computed in a distributed fashion. We report on empirical results.
Conference Paper
Full-text available
In highly dynamic and adversarial domains such as sports, short-term predictions are made by incorporating both local immediate as well global situational information. For forecasting complex events, higher-order models such as Hidden Conditional Random Field (HCRF) have been used to good effect as capture the long-term, high-level semantics of the signal. However, as the prediction is based solely on the hidden layer, fine-grained local information is not incorporated which reduces its predictive capability. In this paper, we propose an “augmented-Hidden Conditional Random Field” (a-HCRF) which incorporates the local observation within the HCRF which boosts it forecasting performance. Given an enormous amount of tracking data from vision-based systems, we show that our approach outperforms current state-of-the-art methods in forecasting short-term events in both soccer and tennis. Additionally, as the tracking data is long-term and continuous, we show our model can be adapted to recent data which improves performance.
Conference Paper
In soccer, the most frequent event that occurs is a pass. For a trained eye, there are a myriad of adjectives which could describe this event (e.g., "majestic pass", "conservative" to "poor-ball"). However, as these events are needed to be coded live and in real-time (most often by human annotators), the current method of grading passes is restricted to the binary labels 0 (unsuccessful) or 1 (successful). Obviously, this is sub-optimal because the quality of a pass needs to be measured on a continuous spectrum (i.e., 0 to 100%) and not a binary value. Additionally, a pass can be measured across multiple dimensions, namely: i) risk -- the likelihood of executing a pass in a given situation, and ii) reward -- the likelihood of a pass creating a chance. In this paper, we show how we estimate both the risk and reward of a pass across two seasons of tracking data captured from a recent professional soccer league with state-of-the-art performance, then showcase various use cases of our deployed passing system.
Conference Paper
Due to the demand for better and deeper analysis in sports, organizations (both professional teams and broadcasters) are looking to use spatiotemporal data in the form of player tracking information to obtain an advantage over their competitors. However, due to the large volume of data, its unstructured nature, and lack of associated team activity labels (e.g. strategic/tactical), effective and efficient strategies to deal with such data have yet to be deployed. A bottleneck restricting such solutions is the lack of a suitable representation (i.e. ordering of players) which is immune to the potentially infinite number of possible permutations of player orderings, in addition to the high dimensionality of temporal signal (e.g. a game of soccer last for 90 mins). Leveraging a recent method which utilizes a "role-representation", as well as a feature reduction strategy that uses a spatiotemporal bilinear basis model to form a compact spatiotemporal representation. Using this representation, we find the most likely formation patterns of a team associated with match events across nearly 14 hours of continuous player and ball tracking data in soccer. Additionally, we show that we can accurately segment a match into distinct game phases and detect highlights. (i.e. shots, corners, free-kicks, etc) completely automatically using a decision-tree formulation.
Conference Paper
Videos of multi-player team sports provide a challenging domain for dynamic scene analysis. Player actions and interactions are complex as they are driven by many factors, such as the short-term goals of the individual player, the overall team strategy, the rules of the sport, and the current context of the game. We show that constrained multi-agent events can be analyzed and even predicted from video. Such analysis requires estimating the global movements of all players in the scene at any time, and is needed for modeling and predicting how the multi-agent play evolves over time on the field. To this end, we propose a novel approach to detect the locations of where the play evolution will proceed, e.g. where interesting events will occur, by tracking player positions and movements over time. We start by extracting the ground level sparse movement of players in each time-step, and then generate a dense motion field. Using this field we detect locations where the motion converges, implying positions towards which the play is evolving. We evaluate our approach by analyzing videos of a variety of complex soccer plays.
POINTWISE: Predicting Points and Valuing Decisions in Real Time with NBA Optical Tracking Data
  • D Cervone
  • A Bornn
  • L Goldsberry
Cervone, D., D'amour, A., Bornn, L., & Goldsberry, K. (2014). POINTWISE: Predicting Points and Valuing Decisions in Real Time with NBA Optical Tracking Data. MIT Sloan Sports Analytics Conference.
The Thin Edge of the Wedge": Accurately Predicting Shot Outcomes in Tennis using Style and Context Priors
  • X Wei
  • P Lucey
  • S Morgan
  • M Reid
  • S Sridharan
Wei, X., Lucey, P., Morgan, S., Reid, M., & Sridharan, S. (2016). "The Thin Edge of the Wedge": Accurately Predicting Shot Outcomes in Tennis using Style and Context Priors. MIT Sloan Sports Analytics Conference.
Relational inductive biases, deep learning, and graph networks
  • P Battaglia
Battaglia, P, et al. (2018). Relational inductive biases, deep learning, and graph networks.
Learning Feature Representations from Football Tracking
  • M Horton
Horton, M. (2020). Learning Feature Representations from Football Tracking. MIT Sloan Sports Analytics Conference.
Spatial Identity for xReceiver" for all teams from Serie A 2018/19. Teams playing from left to right
  • A Figure
Figure A.1: "Spatial Identity for xReceiver" for all teams from Serie A 2018/19. Teams playing from left to right.