SlideShare una empresa de Scribd logo
1 de 41
Descargar para leer sin conexión
Analysing the evolution of
   social aspects of open
source software ecosystems
    Tom Mens & Mathieu Goeminne
           Service de Génie Logiciel
   Département des Sciences Informatiques
   Faculté des Sciences - Université de Mons
                    Belgique
Research topic
       •   Study of software evolution
           •   in particular, the software process and software
               quality
       •   Focus on software ecosystems (see next slide)
       •   Focus on social aspects (see next slide)
       •   By analysing and visualising historical data of open source
           projects
           •   obtained by mining and combining data from
               different types of repositories


Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Goal
   • Analyse the success factors (e.g. popularity,
        quality) of OSS projects?
       •   What are good and bad practices of OSS evolution?
           What lessons can we learn?

   • Study how social aspects co-evolve with, and
        affect the software product and process
   • Exploit this knowledge to improve the current
        practice (e.g., guidelines, tool support)

Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Software ecosystem
       •   There are many views on a software ecosystem. Our focus:
           •   the ecosystem of a single project, containing all artefacts (code,
               documentation, etc.) and persons involved in using, producing or
               modifying these artefacts
           •   the ecosystem of a coherent collection or distribution of individual
               projects
       •   Example: GNOME desktop environment for Linux, containing
           hundreds of applications/project
           •   The ecosystem of each individual GNOME project can be studied
           •   The interaction between all GNOME projects and their
               contributors can be studied
           •   Other examples: Linux OS distributions (e.g. Debian, Ubuntu)

Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Social aspects
       •   Which communities (e.g. users, developers) are involved in a
           software project?
           •   relation between project quality and popularity?

       •   How do communities communicate / interact?
           •   e.g. how are developers driven by user requests and how does this influence
               the project evolution?

       •   How are communities structured?
           •   relation between community structure and software quality or maintainability?

       •   How is work distributed among persons?
           •   relation between distribution of work / responsibility and maintainability?

       •   Which processes (formal or informal) are used?


Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels           Tom Mens & Mathieu Goeminne
Why open source?
       •   Free access to source code, defect data,
           developer and user communication
       •   Historical data available in open repositories
           •   Observable communities
           •   Observable activities
       •   Increasing popularity for personal and
           commercial use
       •   A huge range of community and software
           sizes
Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Methodology
       •   Exploit available data from different repositories
           •   code repositories (e.g. SVN, Git, ...)
           •   mail repositories (mailing lists)
           •   bug repositories (bug trackers)

       •   Select open source projects
       •   Use Herdsman framework
           •   Based on FLOSSMetrics data extraction
           •   Use identity merging tool
           •   Use of statistical analysis and visualisation

Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Selecting Projects
       • Criteria for selecting projects
          •    Availability of data from repositories
          •    Data processable by FLOSSMetrics tools
              •      CVSAnaly2, MLStats, Bicho


          •    Size of considered projects: persons involved,
               code size, activity in each repository
          •    Age of considered projects
Université de Mons    Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Herdsman Framework




Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Identity Merging
Comparing Merge Algorithms
• Based on a reference merge model
  • manually created
  • iterative approach
  • relying on information contained in different
  files (COMMITTERS, MAINTAINERS, AUTHORS, NEWS, README)
• Compute, for each algorithm, precision and
recall w.r.t. reference model
Comparing Merge
                 Algorithms
       • Simple
       • Bird (code and mail repositories only)
        • based on Levenshtein distance
       • Bird, extended for bug repositories
       • Improved
        • Combining ideas from Bird and Robles
Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Comparing Merge Algorithms
  Brasero         Evince
Comparing Merge Algorithms
 (varying parameter values) - Evince
Simple                    Improved




Bird




                        Bird extended
Analysing
Activity Distribution
Analysing
           Activity distribution
     Three different longitudinal studies
     1. Distribution of work, for a single project, using coarse-grained data
        from different repositories
              How are developer activities (commits, mails, bug fixes)
              distributed across repositories? How is activity distributed
              across developers? How does this evolve over time?
     2. Distribution of work, for a single project, using fine-grained data
        from a single repository
              Based on commits of different file types in version repository
     3. Distribution of work across different projects belonging to the
        same community (e.g. GNOME)

Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Activity distribution
                First study
       • Analyse, for individual Gnome projects,
           historical data from version repositories,
           bug trackers and mailing lists
       • Focus on 3 types of activities per person:
           commits, mails, bug report changes
       • How are activities distributed: over
           different persons, over time?

Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Activity distribution
  1

0.9

0.8

0.7

0.6

0.5

0.4

0.3
                                                                                   Brasero
0.2

0.1

  0
                                                                         commits
      0     0.1   0.2   0.3   0.4   0.5   0.6     0.7    0.8   0.9   1



  1
                                                                         mails
0.9

0.8
                                                                         bug report changes
0.7



                                                                                    Evince
0.6

0.5

0.4

0.3

0.2

0.1

  0
      0     0.1   0.2   0.3   0.4   0.5   0.6      0.7   0.8   0.9   1




          Université de Mons                    Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Activity distribution
                  First study
       • Evidence of Pareto principle (20/80 rule)?
        • Most activity is carried out by a small
               group of persons
          • Typically : 20% do 80% of the job
       • Distribution of code activity more unequal
           than mail activity


Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Activity distribution
                First study
       • Analyse activity distribution over time
       • Use econometrics
          •    express inequality in a distribution
          •    aggregation metrics:
               Gini, Hoover, Theil (normalised)
          •    Values between 0 and 1
              •      0 = perfect equality; 1 = perfect inequality

Université de Mons    Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Activity distribution
                     First study
      1
              Evolution of aggregation indices for Evince
     0.9


     0.8


     0.7


     0.6


     0.5


     0.4                                                       Gini
                                                               Hoover
     0.3
                                                               Theil (normalised)
     0.2


     0.1


      0
      Apr-99 Dec-99 Aug-00 Apr-01 Dec-01 Aug-02 Apr-03 Dec-03 Aug-04 Apr-05 Dec-05 Aug-06 Apr-07 Dec-07 Aug-08



Université de Mons         Tuesday 7 June 2011, IWSECO workshop, Brussels                      Tom Mens & Mathieu Goeminne
Activity distribution
                                       First study
  $"
!#,"
!#+"                                                                                                  Evolution of Gini index
!#*"
!#)"
!#("
!#'"
!#&"
                     .4556/7"
                     58697"
                                                                                                              Brasero
!#%"                 :;"0".<8=>?7"
!#$"
  !"
  -./0!)"    1230!*"    -./0!*"    1230!+"    -./0!+"     1230!,"    -./0!,"      1230$!"   -./0$!"

  $"
!#,"
!#+"
!#*"
!#)"
!#("
!#'"                                                                     1233456"                               Evince
!#&"                                                                     37486"
!#%"                                                                     9:;"/<.2/5"1=7>;<6"
!#$"
  !"
  -./0,," -./0!!" -./0!$" -./0!%" -./0!&" -./0!'" -./0!(" -./0!)" -./0!*" -./0!+" -./0!," -./0$!"




       Université de Mons                            Tuesday 7 June 2011, IWSECO workshop, Brussels                Tom Mens & Mathieu Goeminne
Activity distribution
                First study
       • Identifying core groups
         • Display Venn diagrams of most active (top
               20) persons, according to each activity
               type (committing, mailing, bug report
               changing)
          • For each person, show percentage of
               activity attributable to this person
          • Take into account identity merges
Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Activity distribution
                                                 Identifying core groups

                                                                      Brasero




                                                                      Evince


Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels       Tom Mens & Mathieu Goeminne
Activity distribution
                First study
       • Conclusion
        • Activity distributions seem to become
              more and more unequally distributed
          • The Pareto principle is clearly present in
              studied projects
          • For Brasero and Evince, the activity is led
              by a few persons involved in 2 or 3 of the
              defined activities
Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Activity distribution
                First study
       •   Future work
           •   Identify the type of statistical distribution
           •   Use sliding window approach
               •     detect impact of personnel turnover: ignore
                     inactive persons, and discover new active
                     persons
           •   Automate detection of core groups
           •   Study the evolution of core groups over time


Université de Mons    Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Activity distribution
   Second study
Activity distribution
                           Second study
       •   Analysing fine-grained activities, restricted to
           source code repository
       •   Analyse contributors to a project based on
           types of activity they contribute to
    type of activity                                   file type
         coding                      .c, .h, .cc, .pl, .java, .ada,.cpp, .chh, .py
      documenting                         .html, .txt, .ps, .tex, .sgml, .pdf
       translation                             .po, .pot, .mo, .charset
       multimedia                            .mp3, .ogg, .wav, .au, .mid
           ...
Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Activity distribution
                            Second study
•Example: Rhythmbox
 • overlap between 5 activity types
      A = code, B = build, C = devel-doc, D = translation
• Visualised using Venn diagram
   values represent                                       A
                                                                       B
                                                                       5
                                                                                          C
                                                                                          4             D
   number of persons                                      53
                                                                 15
                                                                               6
                                                                                                   84


                                                                                                        71

   involved in a particular                                           19                  51



   activity (i.e., having                                                      19

   committed at least                                            16                            7


   one file of this type)                                                   4

                                                                               0
                                                                                      1




Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels                Tom Mens & Mathieu Goeminne
Activity distribution
                            Second study
Evince - scatter plots of correlation




                                 coding versus developer                  translation versus
 coding versus translation
                                     documentation                    developer documentation


Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels         Tom Mens & Mathieu Goeminne
Activity distribution
                 Second study
Evince - correlation matrix
Correlations with p-value > 0.01 not shown;
Medium (>0.5) to strong correlations (>0.8) are coloured
Evince       document images         i18n         ui           multimedi code            meta         config       build        devel.doc       other
             ation                                             a
documenta           1   0,14715687
tion

images                           1   0,16954996   0,20213002   0,23079965   0,41266264                 0,4596056    0,4757661   0,44918361      0,19411888


i18n                                          1                0,15575507   0,13010625                0,17912559   0,15815786   0,30906678


ui                                                         1   0,16142962   0,56632106   0,31157703   0,54728747   0,55080516   0,51066149      0,16779495


multimedia                                                              1   0,21869572                0,41933948   0,33361174   0,36632992      0,20712209


code                                                                                 1   0,30763128    0,8242121   0,90169181   0,87365497       0,4087696


meta                                                                                              1   0,31914846   0,54909454    0,2434247      0,61723848


config                                                                                                         1   0,89979553   0,95124778      0,43475402


build                                                                                                                       1   0,90540444      0,58183399


devel.doc                                                                                                                                   1   0,41281866


other                                                                                                                                                    1


Université de Mons              Tuesday 7 June 2011, IWSECO workshop, Brussels                                      Tom Mens & Mathieu Goeminne
Activity distribution
               Second study
   • Correlation graph for
      Evince                                                                           docume
                                                                                       ntation
                                                                                        2,2%




      •
                                                      code            config             images

          Nodes represent relative                   46,2%            0,8%               1,2%




          amount of activity of                                                         meta
                                                                                        1,6%

          each type

      •   Edges represent a                        build
                                                  16,1%
                                                                   devel-doc
                                                                     19%
                                                                                   i18n
                                                                                  12,6%
          statistically significant
          (p<0.01) high correlation
          (>0.8)             Edges with weak or not statistically significant correlation,
                                               and activity nodes <0,5% are not shown.
Université de Mons   IWSECO 2011, Brussels               Tom Mens & Mathieu Goeminne
Activity distribution
               Second study
   •   Evince: Interpretation of results

       •   The activities of building,                                          docume
                                                                                ntation
                                                                                 2,2%

           coding and development             code
           documentation are done by
                                                               config             images
                                                               0,8%               1,2%
                                             46,2%

           the same group of persons                                             meta
                                                                                 1,6%




   •   Translators (i18n) are also
       involved in development                build         devel-doc            i18n
       documentation but not in coding       16,1%            19%               12,6%



   •   Other documentation is largely
       independent from these activities


Université de Mons   IWSECO 2011, Brussels        Tom Mens & Mathieu Goeminne
Activity distribution
              Second study
       •   Conclusion
           •   Some activities are done by same group of persons, other
               activities are largely independent

       •   Future work
           •   Study and compare the activity patterns on other projects
           •   Study how the separation of activities influences the process
           •   Compare activity distribution of different types of
               distributions
               •     E.g. translators have a more equal distribution of work
                     than coders

Université de Mons     Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Activity distribution
               Third study
     •   Study the activity of persons across software
         projects in the GNOME ecosystem
     •   Case: Large long-lived GNOME projects using Git
                ID           Project name           Authors        Years   Files
                 A           Banshee                   268          5,9    2700
                     B      Rhythmbox                  364          8,9    937
                 C           Tomboy                    290          6,6    766
                 D               Evince                381          12,1   699
                     E          Brasero                193          4,2    797

Université de Mons       Tuesday 7 June 2011, IWSECO workshop, Brussels      Tom Mens & Mathieu Goeminne
Activity distribution
                                       Third study
                          •        How are activities distributed in different GNOME projects?
                                 •        counted in terms of number of files modified
                                 •        code files are most frequently committed, followed by
                                          build, development documentation and translation files

                                                                                                    '!!!!"
$!!"#                                                                                                                                                                                       "1?=@1-A8)""
 ,!"#                                                                                                                                                                                       "30,-:"
                                                                                     #6DBE62F=.## &!!!!"                                                                                    "?8""
 +!"#
                                                                                     #8512?#
                                                                                                                                                                                            "A39?1-*0)@3*""
 *!"#
                                                                                     #D=##
                                                                                                 %!!!!"                                                                                     "1-0)""
 )!"#                                                                                #F8>D62/5.E8/##
                                                                                                                                                                                            "81)B-+""
                                                                                     #625.##
 (!"#
                                                                                     #=6.G20##                                                                                              "93*CB""
                                                                                                    $!!!!"
 '!"#                                                                                #>8/HG##                                                                                               "8#D*""
 &!"#                                                                                #=$+/##                                                                                                "A-7-=EA39""
                                                                                     #F2<2BIF8>##   #!!!!"
 %!"#                                                                                                                                                                                       "2?8=A""
                                                                                     #7D=BF##
                                                                                                                                                                                            "93A-""
 $!"#                                                                                #>8F2##
                                                                                                        !"
  !"#
                                                                                                             ()*+,--"   .,/0,1234"   53123/"   678*9-"     (:)+-:3"   ;,30<-=="   >,--+-"
        -./0122#   314516789#   :86784#   ;<=/>2#   -?.02?8#   @185A2BB#   C12202#




        Université de Mons                          Tuesday 7 June 2011, IWSECO workshop, Brussels                                                       Tom Mens & Mathieu Goeminne
Activity distribution
               Third study
       •   How are activities distributed in different GNOME projects?
           •   counted in terms of number of persons involved in modifying files
           •   most persons are involved in translations,
               followed by development documentation,
               followed by code and build

                     ,!"#
                     +!"#
                                                                                                  A;7<?00#
                     *!"#
                     )!"#                                                                         B?C=?:1.D#
                     (!"#                                                                         E.:1.C#
                     '!"#
                     &!"#
                     %!"#
                     $!"#
                     !!"#
                                                      ##


                                                                        ##


                                                                         ##

                                                                                          ;##


                                                                                             #




                                                                                                                   ;##
                              ##


                                        ##


                                               ##




                                                                                                 3##


                                                                                                          #
                                                                                          7#




                                                                                                        0@
                            /0




                                                                      0<
                                    4/


                                             .-


                                                     +7


                                                              9




                                                                                                 #2
                                                                                        0=
                                                           78




                                                                                                                   3
                                                                                       >.




                                                                                                                0/
                                                                                                       =?
                                   23




                                                                    ;9
                                              6/
                        #-.




                                                    #3$




                                                                                     #:
                                   #1




                                                          #-.




                                                                                                       #.
                                                                                    =;




                                                                                                              >:
                                           04




                                                                #3 :




                                                                                  07
                                         05




                                                                                                              24
                                                                                2:
                                        #/




                                                                                                            #:
                                                                              .-
                                                                              #/




Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels                                                    Tom Mens & Mathieu Goeminne
Activity distribution
               Third study
       • How many persons are contributing to
           more than 1 GNOME project?
                                              B


                                                                162

                                                                                                             C
                            A                              22                    9
                                                                    18
                                              8 4
                                                                                     7                  77
                                                          11                    21       4
                            127
                                                  4                                      18

                                     15                             70                             24
                                                                                         4
                                                                                                  6
                                                      1
                                              1
                                                                                              6
                                          1                                          1
                                                          4           5

                                                                7     11
                                                                           12

                                                                                                  148
                                      26



                            E                                                                                D

Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels                                              Tom Mens & Mathieu Goeminne
Activity distribution
                   Third study
 •   Which types of activities are carried out by persons involved in multiple
     projects?
     Mainly translators. Coders tend to stick to one project
                         B                                                                                 B


                                             106                                                                            66
                                                                                          C                                                                           C
         A                               9                   0                                A                         9                 9
                                                 0
                         1 0                                                                               5 2                  17
                                                                 0                   46                                                       6                  32
                                     0                       1       0                                                                            5
          96                                                                                  37                       10                21
                             0                                       0                                         3                                  17
                10                               0                               2                 1                            69                          24
                                                                     0                                                                            4
                                                                             2                                                                             6
                                 0                                                                                 1
                         1                                                                                 1
                                                                         3                                                                             2
                     0                                           1                                     1                                      0
                                     0               0                                                                 4         5
                                             5       5                                                                      3    7
                                                         2                                                                           6

                                                                             107                                                                            49
                     6                                                                             23



         E                                                                                D                                                                           D
                                                                                              E


                                 coders                                                                translators
Université de Mons                   Tuesday 7 June 2011, IWSECO workshop, Brussels                                    Tom Mens & Mathieu Goeminne
Activity distribution
               Third study
       • Work in progress
        • Study activities of developers across
              different projects
          • How, when and why do developers
              contribute to different projects?
             • Simultaneously
             • Over time
Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne
Thank you



Université de Mons   Tuesday 7 June 2011, IWSECO workshop, Brussels   Tom Mens & Mathieu Goeminne

Más contenido relacionado

Similar a Analysing the evolution of social aspects of open source software ecosystems

The Development Infrastructure of the TYPO3 Project
The Development Infrastructure of the TYPO3 ProjectThe Development Infrastructure of the TYPO3 Project
The Development Infrastructure of the TYPO3 ProjectSteffen Gebert
 
Howison i conf-transition
Howison i conf-transitionHowison i conf-transition
Howison i conf-transitionJames Howison
 
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsJames Howison
 
Can I Remove This Method? How Live Feedback from the Ecosystem Supports Co-Ev...
Can I Remove This Method? How Live Feedback from the Ecosystem Supports Co-Ev...Can I Remove This Method? How Live Feedback from the Ecosystem Supports Co-Ev...
Can I Remove This Method? How Live Feedback from the Ecosystem Supports Co-Ev...ESUG
 
Software Ecosystem Evolution. It's complex!
Software Ecosystem Evolution. It's complex!Software Ecosystem Evolution. It's complex!
Software Ecosystem Evolution. It's complex!Tom Mens
 
Telecommunication Networks and integrated Services (TNS) Living Lab Presentation
Telecommunication Networks and integrated Services (TNS) Living Lab PresentationTelecommunication Networks and integrated Services (TNS) Living Lab Presentation
Telecommunication Networks and integrated Services (TNS) Living Lab PresentationEuropean Network of Living Labs (ENoLL)
 
Bug tracking à grande échelle et interopérabilité des outils de développement...
Bug tracking à grande échelle et interopérabilité des outils de développement...Bug tracking à grande échelle et interopérabilité des outils de développement...
Bug tracking à grande échelle et interopérabilité des outils de développement...olberger
 
Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Social Event Detection using Multimodal Clustering and Integrating Supervisor...Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Social Event Detection using Multimodal Clustering and Integrating Supervisor...Symeon Papadopoulos
 
ENoLL Networking Global User-Driven Open Innovation Ecosystems Alvaro Oliveira
ENoLL Networking Global User-Driven Open Innovation Ecosystems Alvaro OliveiraENoLL Networking Global User-Driven Open Innovation Ecosystems Alvaro Oliveira
ENoLL Networking Global User-Driven Open Innovation Ecosystems Alvaro OliveiraEuropean Network of Living Labs (ENoLL)
 
On the Globalization of Modeling Languages (June 8th, 2015)
On the Globalization of Modeling Languages (June 8th, 2015)On the Globalization of Modeling Languages (June 8th, 2015)
On the Globalization of Modeling Languages (June 8th, 2015)Benoit Combemale
 
Software Ecosystems = Big Data
Software Ecosystems = Big DataSoftware Ecosystems = Big Data
Software Ecosystems = Big DataTom Mens
 
An empirical study on the Specialisation Effect in Open Source Communities
An empirical study on the Specialisation Effect in Open Source CommunitiesAn empirical study on the Specialisation Effect in Open Source Communities
An empirical study on the Specialisation Effect in Open Source CommunitiesTom Mens
 
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...James Howison
 
FileSender, TF-Storage9, 4 Februari 2011, Budapest
FileSender, TF-Storage9, 4 Februari 2011, BudapestFileSender, TF-Storage9, 4 Februari 2011, Budapest
FileSender, TF-Storage9, 4 Februari 2011, BudapestJan Meijer
 
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation EcosystemsApollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystemsimec.archive
 
National Cheng Kung University
National Cheng Kung University National Cheng Kung University
National Cheng Kung University spikol
 
Where are we going and how are we going to get there?
Where are we going and how are we going to get there?Where are we going and how are we going to get there?
Where are we going and how are we going to get there?David De Roure
 

Similar a Analysing the evolution of social aspects of open source software ecosystems (20)

The Development Infrastructure of the TYPO3 Project
The Development Infrastructure of the TYPO3 ProjectThe Development Infrastructure of the TYPO3 Project
The Development Infrastructure of the TYPO3 Project
 
Howison i conf-transition
Howison i conf-transitionHowison i conf-transition
Howison i conf-transition
 
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
 
Seronto Process
Seronto ProcessSeronto Process
Seronto Process
 
Can I Remove This Method? How Live Feedback from the Ecosystem Supports Co-Ev...
Can I Remove This Method? How Live Feedback from the Ecosystem Supports Co-Ev...Can I Remove This Method? How Live Feedback from the Ecosystem Supports Co-Ev...
Can I Remove This Method? How Live Feedback from the Ecosystem Supports Co-Ev...
 
Software Ecosystem Evolution. It's complex!
Software Ecosystem Evolution. It's complex!Software Ecosystem Evolution. It's complex!
Software Ecosystem Evolution. It's complex!
 
Telecommunication Networks and integrated Services (TNS) Living Lab Presentation
Telecommunication Networks and integrated Services (TNS) Living Lab PresentationTelecommunication Networks and integrated Services (TNS) Living Lab Presentation
Telecommunication Networks and integrated Services (TNS) Living Lab Presentation
 
Bug tracking à grande échelle et interopérabilité des outils de développement...
Bug tracking à grande échelle et interopérabilité des outils de développement...Bug tracking à grande échelle et interopérabilité des outils de développement...
Bug tracking à grande échelle et interopérabilité des outils de développement...
 
Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Social Event Detection using Multimodal Clustering and Integrating Supervisor...Social Event Detection using Multimodal Clustering and Integrating Supervisor...
Social Event Detection using Multimodal Clustering and Integrating Supervisor...
 
ENoLL Networking Global User-Driven Open Innovation Ecosystems Alvaro Oliveira
ENoLL Networking Global User-Driven Open Innovation Ecosystems Alvaro OliveiraENoLL Networking Global User-Driven Open Innovation Ecosystems Alvaro Oliveira
ENoLL Networking Global User-Driven Open Innovation Ecosystems Alvaro Oliveira
 
On the Globalization of Modeling Languages (June 8th, 2015)
On the Globalization of Modeling Languages (June 8th, 2015)On the Globalization of Modeling Languages (June 8th, 2015)
On the Globalization of Modeling Languages (June 8th, 2015)
 
Software Ecosystems = Big Data
Software Ecosystems = Big DataSoftware Ecosystems = Big Data
Software Ecosystems = Big Data
 
An empirical study on the Specialisation Effect in Open Source Communities
An empirical study on the Specialisation Effect in Open Source CommunitiesAn empirical study on the Specialisation Effect in Open Source Communities
An empirical study on the Specialisation Effect in Open Source Communities
 
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
 
WIRA brochure 2010
WIRA brochure 2010WIRA brochure 2010
WIRA brochure 2010
 
FileSender, TF-Storage9, 4 Februari 2011, Budapest
FileSender, TF-Storage9, 4 Februari 2011, BudapestFileSender, TF-Storage9, 4 Februari 2011, Budapest
FileSender, TF-Storage9, 4 Februari 2011, Budapest
 
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation EcosystemsApollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
Apollon - 22/5/12 - 09:00 - User-driven Open Innovation Ecosystems
 
National Cheng Kung University
National Cheng Kung University National Cheng Kung University
National Cheng Kung University
 
Linked (open) data: het met elkaar verbinden van kennis en organisaties
Linked (open) data: het met elkaar verbinden van kennis en organisatiesLinked (open) data: het met elkaar verbinden van kennis en organisaties
Linked (open) data: het met elkaar verbinden van kennis en organisaties
 
Where are we going and how are we going to get there?
Where are we going and how are we going to get there?Where are we going and how are we going to get there?
Where are we going and how are we going to get there?
 

Más de Tom Mens

How to be(come) a successful PhD student
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD studentTom Mens
 
Recognising bot activity in collaborative software development
Recognising bot activity in collaborative software developmentRecognising bot activity in collaborative software development
Recognising bot activity in collaborative software developmentTom Mens
 
A Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHubA Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHubTom Mens
 
The (r)evolution of CI/CD on GitHub
 The (r)evolution of CI/CD on GitHub The (r)evolution of CI/CD on GitHub
The (r)evolution of CI/CD on GitHubTom Mens
 
Nurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the FutureNurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the FutureTom Mens
 
Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?Tom Mens
 
On the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHubOn the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHubTom Mens
 
On backporting practices in package dependency networks
On backporting practices in package dependency networksOn backporting practices in package dependency networks
On backporting practices in package dependency networksTom Mens
 
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and RubygemsComparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and RubygemsTom Mens
 
Lost in Zero Space
Lost in Zero SpaceLost in Zero Space
Lost in Zero SpaceTom Mens
 
Evaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messagesEvaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messagesTom Mens
 
Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!Tom Mens
 
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...Tom Mens
 
On the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystemsOn the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystemsTom Mens
 
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...Tom Mens
 
Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)Tom Mens
 
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)Tom Mens
 
SecoHealth 2019 Research Achievements
SecoHealth 2019 Research AchievementsSecoHealth 2019 Research Achievements
SecoHealth 2019 Research AchievementsTom Mens
 
SECO-Assist 2019 research seminar
SECO-Assist 2019 research seminarSECO-Assist 2019 research seminar
SECO-Assist 2019 research seminarTom Mens
 
Empirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package ManagersEmpirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package ManagersTom Mens
 

Más de Tom Mens (20)

How to be(come) a successful PhD student
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD student
 
Recognising bot activity in collaborative software development
Recognising bot activity in collaborative software developmentRecognising bot activity in collaborative software development
Recognising bot activity in collaborative software development
 
A Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHubA Dataset of Bot and Human Activities in GitHub
A Dataset of Bot and Human Activities in GitHub
 
The (r)evolution of CI/CD on GitHub
 The (r)evolution of CI/CD on GitHub The (r)evolution of CI/CD on GitHub
The (r)evolution of CI/CD on GitHub
 
Nurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the FutureNurturing the Software Ecosystems of the Future
Nurturing the Software Ecosystems of the Future
 
Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?Comment programmer un robot en 30 minutes?
Comment programmer un robot en 30 minutes?
 
On the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHubOn the rise and fall of CI services in GitHub
On the rise and fall of CI services in GitHub
 
On backporting practices in package dependency networks
On backporting practices in package dependency networksOn backporting practices in package dependency networks
On backporting practices in package dependency networks
 
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and RubygemsComparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
Comparing semantic versioning practices in Cargo, npm, Packagist and Rubygems
 
Lost in Zero Space
Lost in Zero SpaceLost in Zero Space
Lost in Zero Space
 
Evaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messagesEvaluating a bot detection model on git commit messages
Evaluating a bot detection model on git commit messages
 
Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!Is my software ecosystem healthy? It depends!
Is my software ecosystem healthy? It depends!
 
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...Bot or not? Detecting bots in GitHub pull request activity based on comment s...
Bot or not? Detecting bots in GitHub pull request activity based on comment s...
 
On the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystemsOn the fragility of open source software packaging ecosystems
On the fragility of open source software packaging ecosystems
 
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...How magic is zero? An Empirical Analysis of Initial Development Releases in S...
How magic is zero? An Empirical Analysis of Initial Development Releases in S...
 
Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)Comparing dependency issues across software package distributions (FOSDEM 2020)
Comparing dependency issues across software package distributions (FOSDEM 2020)
 
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
Measuring Technical Lag in Software Deployments (CHAOSScon 2020)
 
SecoHealth 2019 Research Achievements
SecoHealth 2019 Research AchievementsSecoHealth 2019 Research Achievements
SecoHealth 2019 Research Achievements
 
SECO-Assist 2019 research seminar
SECO-Assist 2019 research seminarSECO-Assist 2019 research seminar
SECO-Assist 2019 research seminar
 
Empirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package ManagersEmpirically Analysing the Socio-Technical Health of Software Package Managers
Empirically Analysing the Socio-Technical Health of Software Package Managers
 

Último

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 

Último (20)

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 

Analysing the evolution of social aspects of open source software ecosystems

  • 1. Analysing the evolution of social aspects of open source software ecosystems Tom Mens & Mathieu Goeminne Service de Génie Logiciel Département des Sciences Informatiques Faculté des Sciences - Université de Mons Belgique
  • 2. Research topic • Study of software evolution • in particular, the software process and software quality • Focus on software ecosystems (see next slide) • Focus on social aspects (see next slide) • By analysing and visualising historical data of open source projects • obtained by mining and combining data from different types of repositories Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 3. Goal • Analyse the success factors (e.g. popularity, quality) of OSS projects? • What are good and bad practices of OSS evolution? What lessons can we learn? • Study how social aspects co-evolve with, and affect the software product and process • Exploit this knowledge to improve the current practice (e.g., guidelines, tool support) Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 4. Software ecosystem • There are many views on a software ecosystem. Our focus: • the ecosystem of a single project, containing all artefacts (code, documentation, etc.) and persons involved in using, producing or modifying these artefacts • the ecosystem of a coherent collection or distribution of individual projects • Example: GNOME desktop environment for Linux, containing hundreds of applications/project • The ecosystem of each individual GNOME project can be studied • The interaction between all GNOME projects and their contributors can be studied • Other examples: Linux OS distributions (e.g. Debian, Ubuntu) Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 5. Social aspects • Which communities (e.g. users, developers) are involved in a software project? • relation between project quality and popularity? • How do communities communicate / interact? • e.g. how are developers driven by user requests and how does this influence the project evolution? • How are communities structured? • relation between community structure and software quality or maintainability? • How is work distributed among persons? • relation between distribution of work / responsibility and maintainability? • Which processes (formal or informal) are used? Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 6. Why open source? • Free access to source code, defect data, developer and user communication • Historical data available in open repositories • Observable communities • Observable activities • Increasing popularity for personal and commercial use • A huge range of community and software sizes Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 7. Methodology • Exploit available data from different repositories • code repositories (e.g. SVN, Git, ...) • mail repositories (mailing lists) • bug repositories (bug trackers) • Select open source projects • Use Herdsman framework • Based on FLOSSMetrics data extraction • Use identity merging tool • Use of statistical analysis and visualisation Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 8. Selecting Projects • Criteria for selecting projects • Availability of data from repositories • Data processable by FLOSSMetrics tools • CVSAnaly2, MLStats, Bicho • Size of considered projects: persons involved, code size, activity in each repository • Age of considered projects Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 9. Herdsman Framework Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 11. Comparing Merge Algorithms • Based on a reference merge model • manually created • iterative approach • relying on information contained in different files (COMMITTERS, MAINTAINERS, AUTHORS, NEWS, README) • Compute, for each algorithm, precision and recall w.r.t. reference model
  • 12. Comparing Merge Algorithms • Simple • Bird (code and mail repositories only) • based on Levenshtein distance • Bird, extended for bug repositories • Improved • Combining ideas from Bird and Robles Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 13. Comparing Merge Algorithms Brasero Evince
  • 14. Comparing Merge Algorithms (varying parameter values) - Evince Simple Improved Bird Bird extended
  • 16. Analysing Activity distribution Three different longitudinal studies 1. Distribution of work, for a single project, using coarse-grained data from different repositories How are developer activities (commits, mails, bug fixes) distributed across repositories? How is activity distributed across developers? How does this evolve over time? 2. Distribution of work, for a single project, using fine-grained data from a single repository Based on commits of different file types in version repository 3. Distribution of work across different projects belonging to the same community (e.g. GNOME) Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 17. Activity distribution First study • Analyse, for individual Gnome projects, historical data from version repositories, bug trackers and mailing lists • Focus on 3 types of activities per person: commits, mails, bug report changes • How are activities distributed: over different persons, over time? Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 18. Activity distribution 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 Brasero 0.2 0.1 0 commits 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1 mails 0.9 0.8 bug report changes 0.7 Evince 0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 19. Activity distribution First study • Evidence of Pareto principle (20/80 rule)? • Most activity is carried out by a small group of persons • Typically : 20% do 80% of the job • Distribution of code activity more unequal than mail activity Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 20. Activity distribution First study • Analyse activity distribution over time • Use econometrics • express inequality in a distribution • aggregation metrics: Gini, Hoover, Theil (normalised) • Values between 0 and 1 • 0 = perfect equality; 1 = perfect inequality Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 21. Activity distribution First study 1 Evolution of aggregation indices for Evince 0.9 0.8 0.7 0.6 0.5 0.4 Gini Hoover 0.3 Theil (normalised) 0.2 0.1 0 Apr-99 Dec-99 Aug-00 Apr-01 Dec-01 Aug-02 Apr-03 Dec-03 Aug-04 Apr-05 Dec-05 Aug-06 Apr-07 Dec-07 Aug-08 Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 22. Activity distribution First study $" !#," !#+" Evolution of Gini index !#*" !#)" !#(" !#'" !#&" .4556/7" 58697" Brasero !#%" :;"0".<8=>?7" !#$" !" -./0!)" 1230!*" -./0!*" 1230!+" -./0!+" 1230!," -./0!," 1230$!" -./0$!" $" !#," !#+" !#*" !#)" !#(" !#'" 1233456" Evince !#&" 37486" !#%" 9:;"/<.2/5"1=7>;<6" !#$" !" -./0,," -./0!!" -./0!$" -./0!%" -./0!&" -./0!'" -./0!(" -./0!)" -./0!*" -./0!+" -./0!," -./0$!" Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 23. Activity distribution First study • Identifying core groups • Display Venn diagrams of most active (top 20) persons, according to each activity type (committing, mailing, bug report changing) • For each person, show percentage of activity attributable to this person • Take into account identity merges Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 24. Activity distribution Identifying core groups Brasero Evince Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 25. Activity distribution First study • Conclusion • Activity distributions seem to become more and more unequally distributed • The Pareto principle is clearly present in studied projects • For Brasero and Evince, the activity is led by a few persons involved in 2 or 3 of the defined activities Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 26. Activity distribution First study • Future work • Identify the type of statistical distribution • Use sliding window approach • detect impact of personnel turnover: ignore inactive persons, and discover new active persons • Automate detection of core groups • Study the evolution of core groups over time Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 27. Activity distribution Second study
  • 28. Activity distribution Second study • Analysing fine-grained activities, restricted to source code repository • Analyse contributors to a project based on types of activity they contribute to type of activity file type coding .c, .h, .cc, .pl, .java, .ada,.cpp, .chh, .py documenting .html, .txt, .ps, .tex, .sgml, .pdf translation .po, .pot, .mo, .charset multimedia .mp3, .ogg, .wav, .au, .mid ... Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 29. Activity distribution Second study •Example: Rhythmbox • overlap between 5 activity types A = code, B = build, C = devel-doc, D = translation • Visualised using Venn diagram values represent A B 5 C 4 D number of persons 53 15 6 84 71 involved in a particular 19 51 activity (i.e., having 19 committed at least 16 7 one file of this type) 4 0 1 Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 30. Activity distribution Second study Evince - scatter plots of correlation coding versus developer translation versus coding versus translation documentation developer documentation Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 31. Activity distribution Second study Evince - correlation matrix Correlations with p-value > 0.01 not shown; Medium (>0.5) to strong correlations (>0.8) are coloured Evince document images i18n ui multimedi code meta config build devel.doc other ation a documenta 1 0,14715687 tion images 1 0,16954996 0,20213002 0,23079965 0,41266264 0,4596056 0,4757661 0,44918361 0,19411888 i18n 1 0,15575507 0,13010625 0,17912559 0,15815786 0,30906678 ui 1 0,16142962 0,56632106 0,31157703 0,54728747 0,55080516 0,51066149 0,16779495 multimedia 1 0,21869572 0,41933948 0,33361174 0,36632992 0,20712209 code 1 0,30763128 0,8242121 0,90169181 0,87365497 0,4087696 meta 1 0,31914846 0,54909454 0,2434247 0,61723848 config 1 0,89979553 0,95124778 0,43475402 build 1 0,90540444 0,58183399 devel.doc 1 0,41281866 other 1 Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 32. Activity distribution Second study • Correlation graph for Evince docume ntation 2,2% • code config images Nodes represent relative 46,2% 0,8% 1,2% amount of activity of meta 1,6% each type • Edges represent a build 16,1% devel-doc 19% i18n 12,6% statistically significant (p<0.01) high correlation (>0.8) Edges with weak or not statistically significant correlation, and activity nodes <0,5% are not shown. Université de Mons IWSECO 2011, Brussels Tom Mens & Mathieu Goeminne
  • 33. Activity distribution Second study • Evince: Interpretation of results • The activities of building, docume ntation 2,2% coding and development code documentation are done by config images 0,8% 1,2% 46,2% the same group of persons meta 1,6% • Translators (i18n) are also involved in development build devel-doc i18n documentation but not in coding 16,1% 19% 12,6% • Other documentation is largely independent from these activities Université de Mons IWSECO 2011, Brussels Tom Mens & Mathieu Goeminne
  • 34. Activity distribution Second study • Conclusion • Some activities are done by same group of persons, other activities are largely independent • Future work • Study and compare the activity patterns on other projects • Study how the separation of activities influences the process • Compare activity distribution of different types of distributions • E.g. translators have a more equal distribution of work than coders Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 35. Activity distribution Third study • Study the activity of persons across software projects in the GNOME ecosystem • Case: Large long-lived GNOME projects using Git ID Project name Authors Years Files A Banshee 268 5,9 2700 B Rhythmbox 364 8,9 937 C Tomboy 290 6,6 766 D Evince 381 12,1 699 E Brasero 193 4,2 797 Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 36. Activity distribution Third study • How are activities distributed in different GNOME projects? • counted in terms of number of files modified • code files are most frequently committed, followed by build, development documentation and translation files '!!!!" $!!"# "1?=@1-A8)"" ,!"# "30,-:" #6DBE62F=.## &!!!!" "?8"" +!"# #8512?# "A39?1-*0)@3*"" *!"# #D=## %!!!!" "1-0)"" )!"# #F8>D62/5.E8/## "81)B-+"" #625.## (!"# #=6.G20## "93*CB"" $!!!!" '!"# #>8/HG## "8#D*"" &!"# #=$+/## "A-7-=EA39"" #F2<2BIF8>## #!!!!" %!"# "2?8=A"" #7D=BF## "93A-"" $!"# #>8F2## !" !"# ()*+,--" .,/0,1234" 53123/" 678*9-" (:)+-:3" ;,30<-==" >,--+-" -./0122# 314516789# :86784# ;<=/>2# -?.02?8# @185A2BB# C12202# Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 37. Activity distribution Third study • How are activities distributed in different GNOME projects? • counted in terms of number of persons involved in modifying files • most persons are involved in translations, followed by development documentation, followed by code and build ,!"# +!"# A;7<?00# *!"# )!"# B?C=?:1.D# (!"# E.:1.C# '!"# &!"# %!"# $!"# !!"# ## ## ## ;## # ;## ## ## ## 3## # 7# 0@ /0 0< 4/ .- +7 9 #2 0= 78 3 >. 0/ =? 23 ;9 6/ #-. #3$ #: #1 #-. #. =; >: 04 #3 : 07 05 24 2: #/ #: .- #/ Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 38. Activity distribution Third study • How many persons are contributing to more than 1 GNOME project? B 162 C A 22 9 18 8 4 7 77 11 21 4 127 4 18 15 70 24 4 6 1 1 6 1 1 4 5 7 11 12 148 26 E D Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 39. Activity distribution Third study • Which types of activities are carried out by persons involved in multiple projects? Mainly translators. Coders tend to stick to one project B B 106 66 C C A 9 0 A 9 9 0 1 0 5 2 17 0 46 6 32 0 1 0 5 96 37 10 21 0 0 3 17 10 0 2 1 69 24 0 4 2 6 0 1 1 1 3 2 0 1 1 0 0 0 4 5 5 5 3 7 2 6 107 49 6 23 E D D E coders translators Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 40. Activity distribution Third study • Work in progress • Study activities of developers across different projects • How, when and why do developers contribute to different projects? • Simultaneously • Over time Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne
  • 41. Thank you Université de Mons Tuesday 7 June 2011, IWSECO workshop, Brussels Tom Mens & Mathieu Goeminne