SlideShare una empresa de Scribd logo
1 de 42
Stirring the melting pot of the sciences:
Leading the way to interdisciplinary research
Mixing Social Science into Computer Science,
Bioinformatics and more.
Natalie Jane de Vries
Introduction - The University of Newcastle and CIBM
• The Newcastle region is the second most
populated area in the Australian state of New
South Wales (approx 510,000)
• Situated 162 km (2 hours) North of Sydney in
the Hunter Region
• University of Newcastle established: 1965
• Directors of CIBM:
Prof. Pablo Moscato and Co-director Prof.
Rodney Scott
The Centre for Bioinformatics, Biomarker Discovery and
Information-based Medicine – Background
• One of only 10 Priority Research Centres of The University
of Newcastle.
• Origin: The Newcastle Bioinformatics Initiative (2002-
2006) established by the work of Moscato and Berretta in
Computer Science
3
Bioinformatics
The application of Computer
Science and Information
Technology to Biology/Life
Sciences
Information-based Medicine
is a shift toward a future of
medicine that can become more
personalized, more predictive,
and ultimately more preventative
“Melting pot” of the Sciences?
• Big Data
• Data Analytics
• Consumer Insights
• Consumer Analytics
• ‘Internet of things’
• Social Media
Analysis
• Clustering/subtyping
/segmenting
• Ordering
• Ranking
• Optimization
4
• Community Detection
• Graph analysis
• Similarity Measures
• Classification
• Characterisation
• Predictive Analytics
• Etc..
5
Agenda
What will I talk about today?
• Part 1) General Introduction to the mixing of Computer Science,
Social Science, Marketing and Consumer Behaviour at out Centre
• Part 2) Clustering and Segmentation
– From Breast Cancer Subtypes to Consumer Behaviours to Social
Media Metrics data and more…
• Part 3) Reverse Engineering Consumer Behaviour Modelling
Constructs from Data
– We introduce the idea of functional constructs to model online
customer engagement behaviours through symbolic regression
• Part 4) Future Research Directions
– Future Directions, Aims, Conclusions and time for questions
6
Part 1: Computer Science and Consumer Behaviour
Research
• Increase in amount and size of consumer-related data
• Online technologies generate large datasets
• Increase in online behaviours towards brands
• Increasing importance of social media in marketing strategies
• Need for greater understanding of consumers through e.g. clustering
consumers (or objects in general) into similar groups
Part 2: Clustering and Segmentation
Complete graph Minimum Spanning Tree Select and remove edges
that are not k-Nearest
Neigbors
Final forest (a
forest is a
set of trees) =
clusters
Previous (large scale) applications of the MST-kNN method:
• U.S. Stock market time series data (Inostroza-Ponta, Berretta, & Moscato, 2011)
• Yeast gene expression data (Inostroza-Ponta, Mendes, Berretta, & Moscato, 2007)
• Alzheimer’s disease data - in the order of 1 million data elements (Arefin, Mathieson, Johnstone, Berretta, & Moscato, 2012)
• Prostate cancer data (Capp et al., 2009)
• Social Media (Facebook) Metrics Data (Lucas et al. 2014)
These examples show the methodology proposed here has a proven scalability for larger
datasets
Novel methodology of clustering by CIBM’s researchers: MST-kNN
Biomarker Discovery and Clustering in
Breast Cancer
9
• Incidence – In 2014, it is estimated that 15,270 women will be
diagnosed with breast cancer in Australia.
• Luminal A
• Luminal B
• HER2-enriched
• Normal-like
• Basal-like
Molecular Subtypes
Treatment
Not all patients need the same treatment or respond to the same treatment
• Surgery
• Radiotherapy
• Hormonal therapy
• Chemotherapy
10
Luminal A
Luminal B
Her2
Normal-like
Basal
Controls
METABRIC data set
PAM50 labels
Figure. MST-kNN clustering.
12
The MST-kNN Clustering Method in Consumer Behaviour Research
Customer Engagement Behaviours- behavioural manifestations
of Customer Engagement (CE) toward a firm after and beyond
purchase (van Doorn et al. 2010)
13
Online Customer Engagement Survey/Questionnaire Tool
Methodological Outline
14Categor
y No.
Explanation
Percentage
of sample
1 Fashion Brands 31.54%
2
Community, Charities, Personality and
Sports Fan Pages
23.99%
3 Other Services 19.68%
4 Other Consumer Goods 8.09%
5 Hospitality (Restaurants, Cafes, Bars) 7.28%
6 Consumer Electronics 7.01%
7 Automotive 2.43%
Respondents’ chosen brand categories
Methodology: Difference Meta-features
The difference of values
between two measured
features might be capable to
distinguish between two
given categories, even when
those features are not able to
do so alone (De Paula et al, 2011)
Previous successful
application of difference
meta-features in Alzheimer’s
Disease biomarker detection
(De Paula et al. 2011) and (Arefin et al.
2012), both in PLoS ONE.
Data collection
and pre-
processing
Meta-features:
Pair-wise
differences
Meta-features:
Pair-wise
products
Intra- and
inter-construct
relationships
Distance
Computation
Data preparation
-6
-4
-2
0
2
4
6
8
10
12
1 2 3 4 5 6 7 8 9 10 11
f1
f2
Meta-f
Class A Class B
-6
-4
-2
0
2
4
6
8
10
12
1 2 3 4 5 6 7 8 9 10 11 12
f1
f2
Meta-f
Class A Class B
Results: Clustering Highlights
Heterogeneous cluster?More homogenous
cluster?
Results: Clustering and Significance Values
Data Rows selected
Distance
Metric
MST-kNN merged
with the kNN cliques of
size
p-values
Wilcoxon’s Test Kruskal-Wallis
Original All
Robust 5NN 0.021187 0.042364
Spearman 6NN 0.025987 0.051962
Robust 6NN 0.028565 0.057117
Pearson 3NN 0.030232 0.060451
Spearman 3NN 0.040661 0.081306
Euclidean 6NN 0.041232 0.082448
Difference
Metafeatures
‘Intra’ constructs
Robust 3NN 0.016551 0.033095
Robust 6NN 0.017177 0.03434
Pearson 3NN 0.018628 0.0372481
Pearson 6NN 0.019066 0.038124
Pearson 5NN 0.019656 0.039303
All Pearson 3NN 0.020594 0.041180
Product
Metafeatures
‘Inter’ Constructs
Spearman 3NN 0.016949 0.033891
Pearson 4NN 0.01757 0.035132
All Pearson 4NN 0.017721 0.035433
‘Inter’ Constructs
Pearson 6NN 0.01781 0.035611
Pearson 3NN 0.017816 0.035624
‘Inter’ Constructs Robust 4NN 0.017998 0.035988
Future Research Directions in this study
• Various domains and contexts to apply the novel process outlined in
this study
• Combine a study using survey data as well as ‘live’ behaviour data from
social networking sites (real-time interactions)
• Further exploration of meta-features in both survey data and ‘real’
online behaviour clustering studies; ‘differences’ meta-features in this
study yielded better results
• This study guides the development of future feature selection models
to identify group of consumers according to higher-order characteristics.
20
The MST-kNN Method in Social Media Metrics Data
Engagement in Motion: Exploring Short Term Dynamics in Page-
level Social Media Metrics
Benjamin Lucas1,2, Ahmed Shamsul Arefin1,3, Natalie de Vries1,3, Regina Berretta1,3, Jamie Carlson1,2, Pablo Moscato1,3
1 The University of Newcastle, Australia
2 Newcastle Business School, Faculty of Business and Law
3 The Priority Research Centre for Bioinformatics, Biomarker Discovery and Information-Based Medicine
21
Part 3: Reverse Engineering Consumer Behaviour
Modelling Constructs from Data
Consumer Behaviour Modelling is usually done by
testing hypotheses that are generated from theory
24
For example:
Source: de Vries & Carlson 2014 – Journal of Brand Management
Items (questions) make up
one theoretical construct in
Structural Equation Modelling
(Hair et al. 2014). For example:
25
26
Symbolic Regression Analysis
27
Symbolic Regression Analysis 28
Figure 2. The Figure shows the items ‘used’ by Eureqa through symbolic regression setting each of
the five ENG items as dependent variables (obtained using the whole data set).
de Vries NJ, Carlson J, Moscato P (2014) A Data-Driven Approach to Reverse Engineering Customer Engagement Models:
Towards Functional Constructs. PLoS ONE 9(7): e102768. doi:10.1371/journal.pone.0102768
http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0102768
Figure 3. Data Set A – Network found as a result of the application of the model finding optimization
software on each variable as a target.
de Vries NJ, Carlson J, Moscato P (2014) A Data-Driven Approach to Reverse Engineering Customer Engagement Models:
Towards Functional Constructs. PLoS ONE 9(7): e102768. doi:10.1371/journal.pone.0102768
http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0102768
Inter-rater Agreement
31
de Vries NJ, Carlson J, Moscato P (2014) A Data-Driven Approach to
Reverse Engineering Customer Engagement Models: Towards Functional
Constructs. PLoS ONE 9(7): e102768. doi:10.1371/journal.pone.0102768
http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0102
768
Our Future research directions
• Work on scalability of methodologies
• Improve optimisation algorithms (minimum distance, maximum
objectives, etc.)
• Meta-heuristics (Memetic Algorithms) for applications on social
sciences
• Network alignment (complex network analysis) of consumer
behaviour networks for uncovering structure in datasets
• Proposal of edited book in large scale “Business and Consumer
Analytics” (Springer)
• Smart Cities Network (sensor data, optimisation of cities and their
networks)
• Digital Economy technologies
UoN and UKM
Things to remember:
• UoN is always open for research collaborations (depending on funds – we operate on a project basis)
• At CIBM we have supercomputing capacity available for large-scale projects
• In our team we have particular strong expertise in operations research and management science
• CIBM is open to diversify into new areas (e.g. computational social science as demonstrated today)
• As Prof. Moscato says: “Do not hesitate to throw and ‘odd-ball’. Either we could be interested, or we
could put you in touch with other collaborators and colleagues”.
 Terima Kasih 
Questions?
References
• Arefin AS, A, Mathieson L, Johnston D, Berretta R, Moscato P (2012) Unveiling Clusters of RNA Transcript Pairs Associated with
Markers of Alzheimer’s Disease Progression, PLOS ONE, DOI: 10.1371/journal.pone.0045535
• Capp et al. (2009) Is there more than one proctitis syndrome? A revisitation using data from the TROG 96.01 trial, Radiotherapy
and Oncology, 90(3), 400-407
• Hair, J. F., Hult, G. T. M., Ringle, C. M. and Sarstedt, M. (2014) A Primer on Partial Least Squares Structural Equation Modeling
(PLS-SEM) Los Angelos: Sage Publications Inc.
• Inostroza-Ponta M, Mendes A, Berretta R, Moscato P (2007) An Integrated QAP-Based Approach to Visualize Patterns of Gene
Expression Similarity, Progress in Artificial Life, Lecture Notes in Computer Science, 4828, pp 156-167
• Inostroza-Ponta M, Berretta R, Moscato P (2011) QAPgrid: A Two Level QAP-Based Approach for Large-Scale Data Analysis and
Visualization, PLOS ONE, DOI: 10.1371/journal.pone.0014468
• Lucas B, Arefin AS, de Vries NJ, Berretta R, Carlson J, Moscato P (2014) Engagement in Motion: Exploring Short Term Dynamics
in Page-Level Social Media Metrics, IEEE Conference on Social Computing and Big Data and Cloud Computing (Sydney)
• de Vries NJ, Carlson J (2014) Examining the drivers and brand performance implications of customer engagement with brands in
the social media environment, Journal of Brand Management, 21, 495-515
• de Vries NJ, Carlson J, Moscato P (2014) A Data-Driven Approach to Reverse Engineering Customer Engagement Models:
Towards Functional Constructs, PLOS ONE, DOI: 10.1371/journal.pone.0102768
• de Vries NJ, Arefin AS, Moscato P (2014) Gauging Heterogeneity in Online Consumer Behaviour Data: A Proximity Graph
Approach, IEEE Conference on Social Computing and Big Data and Cloud Computing (Sydney)
• Marsden J, Budden D, Craig H, Moscato P (2013) Language Individuation and Marker Words: Shakespeare and His Maxwell's
Demon, PLOS ONE, DOI: 10.1371/journal.pone.0066813
• Naeni LM, de Vries NJ, Reis R, Arefin AS, Berretta R, Moscato P (2014) Identifying Communities of Trust and Confidence in the
Charity and Not-for-Profit Sector: A Memetic Algorithm Approach, , IEEE Conference on Social Computing and Big Data and
Cloud Computing (Sydney)
• van Doorn, J., Lemon, K. N., Mittal, V., Nass, S., Pick, D., Pirner, P. and Verhoef, P. C. (2010). Customer Engagement Behavior:
Theoretical Foundations and Research Directions. Journal of Service Research, 13(3): 253-266.
35
APPENDIX
(Extra Slides)
36
New Publication
Published 7th April
2015 in PLOS ONE
N J de Vries
R Reis
P Moscato
Clustering of
consumers based on
trust and donating
behaviours in the not-
for-profit sector
Including symbolic
regression predictive
modeling for consumer
involvement with
charities
37
38
Resulting Segments of the Australian
Market
1. Non-institutionalist charity supporters
2. Resource allocation critics
3. Information-seeking financial sceptics
4. Non-questioning charity supporters
5. Non-trusting sceptics
6. Charity management believers
7. Institutionalist charity believers
http://journals.plos.org/plosone/article?id=10.1371%2Fjo
urnal.pone.0122133
39
IEEE Conference paper
Methodology: Product Meta-features
The product of values between
two measured features might be
capable to distinguish between
two given categories, even when
those features are not able to do
so alone.
This study is the first to trial the
application of this idea.
Left, the values of f1 (blue) and
f2 (red) do not distinguish the
classes well but their product
(meta-feature in green) does.
Data collection
and pre-
processing
Meta-features:
Pair-wise
differences
Meta-features:
Pair-wise
products
Intra- and
inter-construct
relationships
Distance
Computation
Data preparation
0
2
4
6
8
10
12
14
16
18
1 2 3 4 5 6 7 8 9 10 11 12
f1
f2
Meta-f
Class A Class B0
2
4
6
8
10
12
14
16
18
1 2 3 4 5 6 7 8 9 10 11 12
f1
f2
Meta-f
Class A Class B
My publications
• A Data-Driven Approach to Reverse Engineering Customer Engagement
Models: Towards Functional Constructs (de Vries, Carlson and Moscato)
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0102768
• Examining the drivers and brand performance implications of customer
engagement with brands in the social media environment (de Vries and
Carlson): http://www.palgrave-
journals.com/bm/journal/v21/n6/abs/bm201418a.html
• Gauging Heterogeneity in Online Consumer Behaviour Data: A Proximity
Graph Approach (de Vries, Arefin and Moscato)
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7034833
• Engagement in Motion: Exploring Short Term Dynamics in Page-Level Social
Media Metrics (Lucas et al)
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7034813&tag=1
• Identifying Communities of Trust and Confidence in the Charity and Not-for-
Profit Sector: A Memetic Algorithm Approach (Moslemi et al)
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7034835&refinem
ents%3D4251871666%26filter%3DAND%28p_IS_Number%3A7034739%29
Other Sources
First uses of ‘meta-features’:
• Differences in Abundances of Cell-Signalling Proteins in Blood Reveal Novel
Biomarkers for Early Detection Of Clinical Alzheimer's Disease (De Paula et al)
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0017481
• Unveiling Clusters of RNA Transcript Pairs Associated with Markers of Alzheimer’s
Disease Progression (Arefin et al)
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0045535
MST-kNN papers:
• An Integrated QAP-Based Approach to Visualize Patterns of Gene Expression
Similarity (Inostroza Ponta et al) http://link.springer.com/chapter/10.1007/978-3-
540-76931-6_14
• kNN-MST-Agglomerative: A fast and scalable graph-based data clustering approach
on GPU (Arefin et al)
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6295143

Más contenido relacionado

La actualidad más candente

IRJET- Analysis of Rating Difference and User Interest
IRJET- Analysis of Rating Difference and User InterestIRJET- Analysis of Rating Difference and User Interest
IRJET- Analysis of Rating Difference and User InterestIRJET Journal
 
A.hybrid.recommendation.approach.for.a.tourism.system
A.hybrid.recommendation.approach.for.a.tourism.systemA.hybrid.recommendation.approach.for.a.tourism.system
A.hybrid.recommendation.approach.for.a.tourism.systembenny ribeiro
 
Demography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendationDemography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendationUmmeSalmaM1
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systemsvivatechijri
 
Natural language processing through the subtractive mountain clustering algor...
Natural language processing through the subtractive mountain clustering algor...Natural language processing through the subtractive mountain clustering algor...
Natural language processing through the subtractive mountain clustering algor...ijnlc
 
University Public Driven Applications - Big Data and Organizational Design
University Public Driven Applications - Big Data and Organizational Design University Public Driven Applications - Big Data and Organizational Design
University Public Driven Applications - Big Data and Organizational Design maria chiara pettenati
 
Analysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMMAnalysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMMIJERA Editor
 
Research paper impact evaluation for collaborative information supply chain
Research paper   impact evaluation for collaborative information supply chainResearch paper   impact evaluation for collaborative information supply chain
Research paper impact evaluation for collaborative information supply chainKenny Meesters
 
Using Low-Level Interaction Data to Explore User Behaviour in Object Based Me...
Using Low-Level Interaction Data to Explore User Behaviour in Object Based Me...Using Low-Level Interaction Data to Explore User Behaviour in Object Based Me...
Using Low-Level Interaction Data to Explore User Behaviour in Object Based Me...Jonathan Carlton
 
FACIAL AGE ESTIMATION USING TRANSFER LEARNING AND BAYESIAN OPTIMIZATION BASED...
FACIAL AGE ESTIMATION USING TRANSFER LEARNING AND BAYESIAN OPTIMIZATION BASED...FACIAL AGE ESTIMATION USING TRANSFER LEARNING AND BAYESIAN OPTIMIZATION BASED...
FACIAL AGE ESTIMATION USING TRANSFER LEARNING AND BAYESIAN OPTIMIZATION BASED...sipij
 
IRJET- Predicting Social Network Communities Structure Changes and Detection ...
IRJET- Predicting Social Network Communities Structure Changes and Detection ...IRJET- Predicting Social Network Communities Structure Changes and Detection ...
IRJET- Predicting Social Network Communities Structure Changes and Detection ...IRJET Journal
 
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...Kato Mivule
 
Multi-objective NSGA-II based community detection using dynamical evolution s...
Multi-objective NSGA-II based community detection using dynamical evolution s...Multi-objective NSGA-II based community detection using dynamical evolution s...
Multi-objective NSGA-II based community detection using dynamical evolution s...IJECEIAES
 
Behavioural Modelling Outcomes prediction using Casual Factors
Behavioural Modelling Outcomes prediction using Casual  FactorsBehavioural Modelling Outcomes prediction using Casual  Factors
Behavioural Modelling Outcomes prediction using Casual FactorsIJMER
 
Extending canonical action research model to implement social media in microb...
Extending canonical action research model to implement social media in microb...Extending canonical action research model to implement social media in microb...
Extending canonical action research model to implement social media in microb...Debashish Mandal
 
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...IRJET Journal
 

La actualidad más candente (19)

IRJET- Analysis of Rating Difference and User Interest
IRJET- Analysis of Rating Difference and User InterestIRJET- Analysis of Rating Difference and User Interest
IRJET- Analysis of Rating Difference and User Interest
 
A.hybrid.recommendation.approach.for.a.tourism.system
A.hybrid.recommendation.approach.for.a.tourism.systemA.hybrid.recommendation.approach.for.a.tourism.system
A.hybrid.recommendation.approach.for.a.tourism.system
 
Demography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendationDemography basedhybridrecommendersystemformovierecommendation
Demography basedhybridrecommendersystemformovierecommendation
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Natural language processing through the subtractive mountain clustering algor...
Natural language processing through the subtractive mountain clustering algor...Natural language processing through the subtractive mountain clustering algor...
Natural language processing through the subtractive mountain clustering algor...
 
University Public Driven Applications - Big Data and Organizational Design
University Public Driven Applications - Big Data and Organizational Design University Public Driven Applications - Big Data and Organizational Design
University Public Driven Applications - Big Data and Organizational Design
 
Analysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMMAnalysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMM
 
Research paper impact evaluation for collaborative information supply chain
Research paper   impact evaluation for collaborative information supply chainResearch paper   impact evaluation for collaborative information supply chain
Research paper impact evaluation for collaborative information supply chain
 
Using Low-Level Interaction Data to Explore User Behaviour in Object Based Me...
Using Low-Level Interaction Data to Explore User Behaviour in Object Based Me...Using Low-Level Interaction Data to Explore User Behaviour in Object Based Me...
Using Low-Level Interaction Data to Explore User Behaviour in Object Based Me...
 
FACIAL AGE ESTIMATION USING TRANSFER LEARNING AND BAYESIAN OPTIMIZATION BASED...
FACIAL AGE ESTIMATION USING TRANSFER LEARNING AND BAYESIAN OPTIMIZATION BASED...FACIAL AGE ESTIMATION USING TRANSFER LEARNING AND BAYESIAN OPTIMIZATION BASED...
FACIAL AGE ESTIMATION USING TRANSFER LEARNING AND BAYESIAN OPTIMIZATION BASED...
 
IRJET- Predicting Social Network Communities Structure Changes and Detection ...
IRJET- Predicting Social Network Communities Structure Changes and Detection ...IRJET- Predicting Social Network Communities Structure Changes and Detection ...
IRJET- Predicting Social Network Communities Structure Changes and Detection ...
 
243
243243
243
 
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...
A Comparative Analysis of Data Privacy and Utility Parameter Adjustment, Usin...
 
Multi-objective NSGA-II based community detection using dynamical evolution s...
Multi-objective NSGA-II based community detection using dynamical evolution s...Multi-objective NSGA-II based community detection using dynamical evolution s...
Multi-objective NSGA-II based community detection using dynamical evolution s...
 
Mr1480.ch4
Mr1480.ch4Mr1480.ch4
Mr1480.ch4
 
Behavioural Modelling Outcomes prediction using Casual Factors
Behavioural Modelling Outcomes prediction using Casual  FactorsBehavioural Modelling Outcomes prediction using Casual  Factors
Behavioural Modelling Outcomes prediction using Casual Factors
 
[IJCT-V3I2P30] Authors: Sunny Sharma
[IJCT-V3I2P30] Authors: Sunny Sharma[IJCT-V3I2P30] Authors: Sunny Sharma
[IJCT-V3I2P30] Authors: Sunny Sharma
 
Extending canonical action research model to implement social media in microb...
Extending canonical action research model to implement social media in microb...Extending canonical action research model to implement social media in microb...
Extending canonical action research model to implement social media in microb...
 
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
IRJET- Review on Different Recommendation Techniques for GRS in Online Social...
 

Destacado

My project for Mr. Medina's class Ancient Greece
My project for Mr. Medina's class Ancient GreeceMy project for Mr. Medina's class Ancient Greece
My project for Mr. Medina's class Ancient Greecebole9253
 
Γυναίκα Επιχειρηματίας στην εποχή της κρίσης.
Γυναίκα Επιχειρηματίας στην εποχή της κρίσης.Γυναίκα Επιχειρηματίας στην εποχή της κρίσης.
Γυναίκα Επιχειρηματίας στην εποχή της κρίσης.Iris Cremer
 
sampel representatif buku A. Arens, Alvin. Bab 14.trans
sampel representatif buku A. Arens, Alvin. Bab 14.transsampel representatif buku A. Arens, Alvin. Bab 14.trans
sampel representatif buku A. Arens, Alvin. Bab 14.transRita Alfian
 
Οι φοβίες μας. Κατανόηση και Αντιμετώπιση.
Οι φοβίες μας. Κατανόηση και Αντιμετώπιση.Οι φοβίες μας. Κατανόηση και Αντιμετώπιση.
Οι φοβίες μας. Κατανόηση και Αντιμετώπιση.Iris Cremer
 
TEJIDO CARTILAGINOSO
TEJIDO CARTILAGINOSOTEJIDO CARTILAGINOSO
TEJIDO CARTILAGINOSOBryan Salcedo
 
Nutriz.1
Nutriz.1Nutriz.1
Nutriz.1sere94
 
ANZMAC Customer Engagement Presentation
ANZMAC Customer Engagement PresentationANZMAC Customer Engagement Presentation
ANZMAC Customer Engagement PresentationNatalie de Vries
 
Cosa sono le costellazioni Familiari?
Cosa sono le costellazioni Familiari?Cosa sono le costellazioni Familiari?
Cosa sono le costellazioni Familiari?sere94
 
Inversion privada en el sector agrario PERU
Inversion privada en el sector agrario PERU Inversion privada en el sector agrario PERU
Inversion privada en el sector agrario PERU Josselyn Yajayra
 
la sociedad andina y sus aportes a la formación de la identidad peruana
la sociedad andina y sus aportes a la formación de la identidad peruanala sociedad andina y sus aportes a la formación de la identidad peruana
la sociedad andina y sus aportes a la formación de la identidad peruanaJosselyn Yajayra
 
diversidad cultural y patrimonio cultural en el perú
diversidad cultural y patrimonio cultural en el perúdiversidad cultural y patrimonio cultural en el perú
diversidad cultural y patrimonio cultural en el perúJosselyn Yajayra
 

Destacado (13)

Cyberbullying
Cyberbullying     Cyberbullying
Cyberbullying
 
My project for Mr. Medina's class Ancient Greece
My project for Mr. Medina's class Ancient GreeceMy project for Mr. Medina's class Ancient Greece
My project for Mr. Medina's class Ancient Greece
 
Γυναίκα Επιχειρηματίας στην εποχή της κρίσης.
Γυναίκα Επιχειρηματίας στην εποχή της κρίσης.Γυναίκα Επιχειρηματίας στην εποχή της κρίσης.
Γυναίκα Επιχειρηματίας στην εποχή της κρίσης.
 
sampel representatif buku A. Arens, Alvin. Bab 14.trans
sampel representatif buku A. Arens, Alvin. Bab 14.transsampel representatif buku A. Arens, Alvin. Bab 14.trans
sampel representatif buku A. Arens, Alvin. Bab 14.trans
 
Οι φοβίες μας. Κατανόηση και Αντιμετώπιση.
Οι φοβίες μας. Κατανόηση και Αντιμετώπιση.Οι φοβίες μας. Κατανόηση και Αντιμετώπιση.
Οι φοβίες μας. Κατανόηση και Αντιμετώπιση.
 
TEJIDO CARTILAGINOSO
TEJIDO CARTILAGINOSOTEJIDO CARTILAGINOSO
TEJIDO CARTILAGINOSO
 
Nutriz.1
Nutriz.1Nutriz.1
Nutriz.1
 
ANZMAC Customer Engagement Presentation
ANZMAC Customer Engagement PresentationANZMAC Customer Engagement Presentation
ANZMAC Customer Engagement Presentation
 
Cosa sono le costellazioni Familiari?
Cosa sono le costellazioni Familiari?Cosa sono le costellazioni Familiari?
Cosa sono le costellazioni Familiari?
 
Inversion privada en el sector agrario PERU
Inversion privada en el sector agrario PERU Inversion privada en el sector agrario PERU
Inversion privada en el sector agrario PERU
 
Ley general del ambiente
Ley general del ambienteLey general del ambiente
Ley general del ambiente
 
la sociedad andina y sus aportes a la formación de la identidad peruana
la sociedad andina y sus aportes a la formación de la identidad peruanala sociedad andina y sus aportes a la formación de la identidad peruana
la sociedad andina y sus aportes a la formación de la identidad peruana
 
diversidad cultural y patrimonio cultural en el perú
diversidad cultural y patrimonio cultural en el perúdiversidad cultural y patrimonio cultural en el perú
diversidad cultural y patrimonio cultural en el perú
 

Similar a "Melting Pot" of the Sciences in interdisciplinary research

Supervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For CancerSupervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For Cancerpaperpublications3
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeLizLyon
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis? Amit Sheth
 
Towards Decision Support and Goal AchievementIdentifying Ac.docx
Towards Decision Support and Goal AchievementIdentifying Ac.docxTowards Decision Support and Goal AchievementIdentifying Ac.docx
Towards Decision Support and Goal AchievementIdentifying Ac.docxturveycharlyn
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data IJCERT JOURNAL
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science LandscapePhilip Bourne
 
Big data divided (24 march2014)
Big data divided (24 march2014)Big data divided (24 march2014)
Big data divided (24 march2014)Han Woo PARK
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfvishal choudhary
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterprisePhilip Bourne
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Philip Bourne
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedPhilip Bourne
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptxshalini s
 
Ci2004-10.doc
Ci2004-10.docCi2004-10.doc
Ci2004-10.docbutest
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxssuser1a4f0f
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxwahiba ben abdessalem
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeLizLyon
 
DSS_Understanding_the_paradigm_shift.pdf
DSS_Understanding_the_paradigm_shift.pdfDSS_Understanding_the_paradigm_shift.pdf
DSS_Understanding_the_paradigm_shift.pdfBizuayehuDesalegn
 
Improving health care outcomes with responsible data science #escience2018
Improving health care outcomes with responsible data science #escience2018Improving health care outcomes with responsible data science #escience2018
Improving health care outcomes with responsible data science #escience2018Wessel Kraaij
 

Similar a "Melting Pot" of the Sciences in interdisciplinary research (20)

Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
Supervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For CancerSupervised Multi Attribute Gene Manipulation For Cancer
Supervised Multi Attribute Gene Manipulation For Cancer
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
What's up at Kno.e.sis?
What's up at Kno.e.sis? What's up at Kno.e.sis?
What's up at Kno.e.sis?
 
Towards Decision Support and Goal AchievementIdentifying Ac.docx
Towards Decision Support and Goal AchievementIdentifying Ac.docxTowards Decision Support and Goal AchievementIdentifying Ac.docx
Towards Decision Support and Goal AchievementIdentifying Ac.docx
 
Challenges and outlook with Big Data
Challenges and outlook with Big Data Challenges and outlook with Big Data
Challenges and outlook with Big Data
 
The Analytics and Data Science Landscape
The Analytics and Data Science LandscapeThe Analytics and Data Science Landscape
The Analytics and Data Science Landscape
 
Big data divided (24 march2014)
Big data divided (24 march2014)Big data divided (24 march2014)
Big data divided (24 march2014)
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdf
 
A Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital EnterpriseA Successful Academic Medical Center Must be a Truly Digital Enterprise
A Successful Academic Medical Center Must be a Truly Digital Enterprise
 
Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?Data Science in Biomedicine - Where Are We Headed?
Data Science in Biomedicine - Where Are We Headed?
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Real-time applications of Data Science.pptx
Real-time applications  of Data Science.pptxReal-time applications  of Data Science.pptx
Real-time applications of Data Science.pptx
 
Ci2004-10.doc
Ci2004-10.docCi2004-10.doc
Ci2004-10.doc
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Mind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and PracticeMind the Gap: Reflections on Data Policies and Practice
Mind the Gap: Reflections on Data Policies and Practice
 
DSS_Understanding_the_paradigm_shift.pdf
DSS_Understanding_the_paradigm_shift.pdfDSS_Understanding_the_paradigm_shift.pdf
DSS_Understanding_the_paradigm_shift.pdf
 
Improving health care outcomes with responsible data science #escience2018
Improving health care outcomes with responsible data science #escience2018Improving health care outcomes with responsible data science #escience2018
Improving health care outcomes with responsible data science #escience2018
 
Research-KS-Jun2015
Research-KS-Jun2015Research-KS-Jun2015
Research-KS-Jun2015
 

Último

My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Baileyhlharris
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfSenaatti-kiinteistöt
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...amilabibi1
 
Digital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalDigital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalFabian de Rijk
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfSkillCertProExams
 
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...David Celestin
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar TrainingKylaCullinane
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatmentnswingard
 
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdfSOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdfMahamudul Hasan
 
lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lodhisaajjda
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIINhPhngng3
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoKayode Fayemi
 

Último (15)

My Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle BaileyMy Presentation "In Your Hands" by Halle Bailey
My Presentation "In Your Hands" by Halle Bailey
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
Bring back lost lover in USA, Canada ,Uk ,Australia ,London Lost Love Spell C...
 
Digital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of DrupalDigital collaboration with Microsoft 365 as extension of Drupal
Digital collaboration with Microsoft 365 as extension of Drupal
 
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdfAWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
AWS Data Engineer Associate (DEA-C01) Exam Dumps 2024.pdf
 
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
Proofreading- Basics to Artificial Intelligence Integration - Presentation:Sl...
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
ICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdfICT role in 21st century education and it's challenges.pdf
ICT role in 21st century education and it's challenges.pdf
 
Dreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video TreatmentDreaming Marissa Sánchez Music Video Treatment
Dreaming Marissa Sánchez Music Video Treatment
 
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdfSOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
SOLID WASTE MANAGEMENT SYSTEM OF FENI PAURASHAVA, BANGLADESH.pdf
 
lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.lONG QUESTION ANSWER PAKISTAN STUDIES10.
lONG QUESTION ANSWER PAKISTAN STUDIES10.
 
Dreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio IIIDreaming Music Video Treatment _ Project & Portfolio III
Dreaming Music Video Treatment _ Project & Portfolio III
 
Uncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac FolorunsoUncommon Grace The Autobiography of Isaac Folorunso
Uncommon Grace The Autobiography of Isaac Folorunso
 

"Melting Pot" of the Sciences in interdisciplinary research

  • 1. Stirring the melting pot of the sciences: Leading the way to interdisciplinary research Mixing Social Science into Computer Science, Bioinformatics and more. Natalie Jane de Vries
  • 2. Introduction - The University of Newcastle and CIBM • The Newcastle region is the second most populated area in the Australian state of New South Wales (approx 510,000) • Situated 162 km (2 hours) North of Sydney in the Hunter Region • University of Newcastle established: 1965 • Directors of CIBM: Prof. Pablo Moscato and Co-director Prof. Rodney Scott
  • 3. The Centre for Bioinformatics, Biomarker Discovery and Information-based Medicine – Background • One of only 10 Priority Research Centres of The University of Newcastle. • Origin: The Newcastle Bioinformatics Initiative (2002- 2006) established by the work of Moscato and Berretta in Computer Science 3 Bioinformatics The application of Computer Science and Information Technology to Biology/Life Sciences Information-based Medicine is a shift toward a future of medicine that can become more personalized, more predictive, and ultimately more preventative
  • 4. “Melting pot” of the Sciences? • Big Data • Data Analytics • Consumer Insights • Consumer Analytics • ‘Internet of things’ • Social Media Analysis • Clustering/subtyping /segmenting • Ordering • Ranking • Optimization 4 • Community Detection • Graph analysis • Similarity Measures • Classification • Characterisation • Predictive Analytics • Etc..
  • 5. 5
  • 6. Agenda What will I talk about today? • Part 1) General Introduction to the mixing of Computer Science, Social Science, Marketing and Consumer Behaviour at out Centre • Part 2) Clustering and Segmentation – From Breast Cancer Subtypes to Consumer Behaviours to Social Media Metrics data and more… • Part 3) Reverse Engineering Consumer Behaviour Modelling Constructs from Data – We introduce the idea of functional constructs to model online customer engagement behaviours through symbolic regression • Part 4) Future Research Directions – Future Directions, Aims, Conclusions and time for questions 6
  • 7. Part 1: Computer Science and Consumer Behaviour Research • Increase in amount and size of consumer-related data • Online technologies generate large datasets • Increase in online behaviours towards brands • Increasing importance of social media in marketing strategies • Need for greater understanding of consumers through e.g. clustering consumers (or objects in general) into similar groups
  • 8. Part 2: Clustering and Segmentation Complete graph Minimum Spanning Tree Select and remove edges that are not k-Nearest Neigbors Final forest (a forest is a set of trees) = clusters Previous (large scale) applications of the MST-kNN method: • U.S. Stock market time series data (Inostroza-Ponta, Berretta, & Moscato, 2011) • Yeast gene expression data (Inostroza-Ponta, Mendes, Berretta, & Moscato, 2007) • Alzheimer’s disease data - in the order of 1 million data elements (Arefin, Mathieson, Johnstone, Berretta, & Moscato, 2012) • Prostate cancer data (Capp et al., 2009) • Social Media (Facebook) Metrics Data (Lucas et al. 2014) These examples show the methodology proposed here has a proven scalability for larger datasets Novel methodology of clustering by CIBM’s researchers: MST-kNN
  • 9. Biomarker Discovery and Clustering in Breast Cancer 9 • Incidence – In 2014, it is estimated that 15,270 women will be diagnosed with breast cancer in Australia. • Luminal A • Luminal B • HER2-enriched • Normal-like • Basal-like Molecular Subtypes
  • 10. Treatment Not all patients need the same treatment or respond to the same treatment • Surgery • Radiotherapy • Hormonal therapy • Chemotherapy 10
  • 11. Luminal A Luminal B Her2 Normal-like Basal Controls METABRIC data set PAM50 labels Figure. MST-kNN clustering.
  • 12. 12 The MST-kNN Clustering Method in Consumer Behaviour Research
  • 13. Customer Engagement Behaviours- behavioural manifestations of Customer Engagement (CE) toward a firm after and beyond purchase (van Doorn et al. 2010) 13 Online Customer Engagement Survey/Questionnaire Tool
  • 14. Methodological Outline 14Categor y No. Explanation Percentage of sample 1 Fashion Brands 31.54% 2 Community, Charities, Personality and Sports Fan Pages 23.99% 3 Other Services 19.68% 4 Other Consumer Goods 8.09% 5 Hospitality (Restaurants, Cafes, Bars) 7.28% 6 Consumer Electronics 7.01% 7 Automotive 2.43% Respondents’ chosen brand categories
  • 15. Methodology: Difference Meta-features The difference of values between two measured features might be capable to distinguish between two given categories, even when those features are not able to do so alone (De Paula et al, 2011) Previous successful application of difference meta-features in Alzheimer’s Disease biomarker detection (De Paula et al. 2011) and (Arefin et al. 2012), both in PLoS ONE. Data collection and pre- processing Meta-features: Pair-wise differences Meta-features: Pair-wise products Intra- and inter-construct relationships Distance Computation Data preparation -6 -4 -2 0 2 4 6 8 10 12 1 2 3 4 5 6 7 8 9 10 11 f1 f2 Meta-f Class A Class B -6 -4 -2 0 2 4 6 8 10 12 1 2 3 4 5 6 7 8 9 10 11 12 f1 f2 Meta-f Class A Class B
  • 16.
  • 17. Results: Clustering Highlights Heterogeneous cluster?More homogenous cluster?
  • 18. Results: Clustering and Significance Values Data Rows selected Distance Metric MST-kNN merged with the kNN cliques of size p-values Wilcoxon’s Test Kruskal-Wallis Original All Robust 5NN 0.021187 0.042364 Spearman 6NN 0.025987 0.051962 Robust 6NN 0.028565 0.057117 Pearson 3NN 0.030232 0.060451 Spearman 3NN 0.040661 0.081306 Euclidean 6NN 0.041232 0.082448 Difference Metafeatures ‘Intra’ constructs Robust 3NN 0.016551 0.033095 Robust 6NN 0.017177 0.03434 Pearson 3NN 0.018628 0.0372481 Pearson 6NN 0.019066 0.038124 Pearson 5NN 0.019656 0.039303 All Pearson 3NN 0.020594 0.041180 Product Metafeatures ‘Inter’ Constructs Spearman 3NN 0.016949 0.033891 Pearson 4NN 0.01757 0.035132 All Pearson 4NN 0.017721 0.035433 ‘Inter’ Constructs Pearson 6NN 0.01781 0.035611 Pearson 3NN 0.017816 0.035624 ‘Inter’ Constructs Robust 4NN 0.017998 0.035988
  • 19. Future Research Directions in this study • Various domains and contexts to apply the novel process outlined in this study • Combine a study using survey data as well as ‘live’ behaviour data from social networking sites (real-time interactions) • Further exploration of meta-features in both survey data and ‘real’ online behaviour clustering studies; ‘differences’ meta-features in this study yielded better results • This study guides the development of future feature selection models to identify group of consumers according to higher-order characteristics.
  • 20. 20 The MST-kNN Method in Social Media Metrics Data Engagement in Motion: Exploring Short Term Dynamics in Page- level Social Media Metrics Benjamin Lucas1,2, Ahmed Shamsul Arefin1,3, Natalie de Vries1,3, Regina Berretta1,3, Jamie Carlson1,2, Pablo Moscato1,3 1 The University of Newcastle, Australia 2 Newcastle Business School, Faculty of Business and Law 3 The Priority Research Centre for Bioinformatics, Biomarker Discovery and Information-Based Medicine
  • 21. 21
  • 22.
  • 23.
  • 24. Part 3: Reverse Engineering Consumer Behaviour Modelling Constructs from Data Consumer Behaviour Modelling is usually done by testing hypotheses that are generated from theory 24 For example: Source: de Vries & Carlson 2014 – Journal of Brand Management Items (questions) make up one theoretical construct in Structural Equation Modelling (Hair et al. 2014). For example:
  • 25. 25
  • 26. 26
  • 29. Figure 2. The Figure shows the items ‘used’ by Eureqa through symbolic regression setting each of the five ENG items as dependent variables (obtained using the whole data set). de Vries NJ, Carlson J, Moscato P (2014) A Data-Driven Approach to Reverse Engineering Customer Engagement Models: Towards Functional Constructs. PLoS ONE 9(7): e102768. doi:10.1371/journal.pone.0102768 http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0102768
  • 30. Figure 3. Data Set A – Network found as a result of the application of the model finding optimization software on each variable as a target. de Vries NJ, Carlson J, Moscato P (2014) A Data-Driven Approach to Reverse Engineering Customer Engagement Models: Towards Functional Constructs. PLoS ONE 9(7): e102768. doi:10.1371/journal.pone.0102768 http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0102768
  • 31. Inter-rater Agreement 31 de Vries NJ, Carlson J, Moscato P (2014) A Data-Driven Approach to Reverse Engineering Customer Engagement Models: Towards Functional Constructs. PLoS ONE 9(7): e102768. doi:10.1371/journal.pone.0102768 http://127.0.0.1:8081/plosone/article?id=info:doi/10.1371/journal.pone.0102 768
  • 32. Our Future research directions • Work on scalability of methodologies • Improve optimisation algorithms (minimum distance, maximum objectives, etc.) • Meta-heuristics (Memetic Algorithms) for applications on social sciences • Network alignment (complex network analysis) of consumer behaviour networks for uncovering structure in datasets • Proposal of edited book in large scale “Business and Consumer Analytics” (Springer) • Smart Cities Network (sensor data, optimisation of cities and their networks) • Digital Economy technologies
  • 33. UoN and UKM Things to remember: • UoN is always open for research collaborations (depending on funds – we operate on a project basis) • At CIBM we have supercomputing capacity available for large-scale projects • In our team we have particular strong expertise in operations research and management science • CIBM is open to diversify into new areas (e.g. computational social science as demonstrated today) • As Prof. Moscato says: “Do not hesitate to throw and ‘odd-ball’. Either we could be interested, or we could put you in touch with other collaborators and colleagues”.
  • 34.  Terima Kasih  Questions?
  • 35. References • Arefin AS, A, Mathieson L, Johnston D, Berretta R, Moscato P (2012) Unveiling Clusters of RNA Transcript Pairs Associated with Markers of Alzheimer’s Disease Progression, PLOS ONE, DOI: 10.1371/journal.pone.0045535 • Capp et al. (2009) Is there more than one proctitis syndrome? A revisitation using data from the TROG 96.01 trial, Radiotherapy and Oncology, 90(3), 400-407 • Hair, J. F., Hult, G. T. M., Ringle, C. M. and Sarstedt, M. (2014) A Primer on Partial Least Squares Structural Equation Modeling (PLS-SEM) Los Angelos: Sage Publications Inc. • Inostroza-Ponta M, Mendes A, Berretta R, Moscato P (2007) An Integrated QAP-Based Approach to Visualize Patterns of Gene Expression Similarity, Progress in Artificial Life, Lecture Notes in Computer Science, 4828, pp 156-167 • Inostroza-Ponta M, Berretta R, Moscato P (2011) QAPgrid: A Two Level QAP-Based Approach for Large-Scale Data Analysis and Visualization, PLOS ONE, DOI: 10.1371/journal.pone.0014468 • Lucas B, Arefin AS, de Vries NJ, Berretta R, Carlson J, Moscato P (2014) Engagement in Motion: Exploring Short Term Dynamics in Page-Level Social Media Metrics, IEEE Conference on Social Computing and Big Data and Cloud Computing (Sydney) • de Vries NJ, Carlson J (2014) Examining the drivers and brand performance implications of customer engagement with brands in the social media environment, Journal of Brand Management, 21, 495-515 • de Vries NJ, Carlson J, Moscato P (2014) A Data-Driven Approach to Reverse Engineering Customer Engagement Models: Towards Functional Constructs, PLOS ONE, DOI: 10.1371/journal.pone.0102768 • de Vries NJ, Arefin AS, Moscato P (2014) Gauging Heterogeneity in Online Consumer Behaviour Data: A Proximity Graph Approach, IEEE Conference on Social Computing and Big Data and Cloud Computing (Sydney) • Marsden J, Budden D, Craig H, Moscato P (2013) Language Individuation and Marker Words: Shakespeare and His Maxwell's Demon, PLOS ONE, DOI: 10.1371/journal.pone.0066813 • Naeni LM, de Vries NJ, Reis R, Arefin AS, Berretta R, Moscato P (2014) Identifying Communities of Trust and Confidence in the Charity and Not-for-Profit Sector: A Memetic Algorithm Approach, , IEEE Conference on Social Computing and Big Data and Cloud Computing (Sydney) • van Doorn, J., Lemon, K. N., Mittal, V., Nass, S., Pick, D., Pirner, P. and Verhoef, P. C. (2010). Customer Engagement Behavior: Theoretical Foundations and Research Directions. Journal of Service Research, 13(3): 253-266. 35
  • 37. New Publication Published 7th April 2015 in PLOS ONE N J de Vries R Reis P Moscato Clustering of consumers based on trust and donating behaviours in the not- for-profit sector Including symbolic regression predictive modeling for consumer involvement with charities 37
  • 38. 38
  • 39. Resulting Segments of the Australian Market 1. Non-institutionalist charity supporters 2. Resource allocation critics 3. Information-seeking financial sceptics 4. Non-questioning charity supporters 5. Non-trusting sceptics 6. Charity management believers 7. Institutionalist charity believers http://journals.plos.org/plosone/article?id=10.1371%2Fjo urnal.pone.0122133 39
  • 40. IEEE Conference paper Methodology: Product Meta-features The product of values between two measured features might be capable to distinguish between two given categories, even when those features are not able to do so alone. This study is the first to trial the application of this idea. Left, the values of f1 (blue) and f2 (red) do not distinguish the classes well but their product (meta-feature in green) does. Data collection and pre- processing Meta-features: Pair-wise differences Meta-features: Pair-wise products Intra- and inter-construct relationships Distance Computation Data preparation 0 2 4 6 8 10 12 14 16 18 1 2 3 4 5 6 7 8 9 10 11 12 f1 f2 Meta-f Class A Class B0 2 4 6 8 10 12 14 16 18 1 2 3 4 5 6 7 8 9 10 11 12 f1 f2 Meta-f Class A Class B
  • 41. My publications • A Data-Driven Approach to Reverse Engineering Customer Engagement Models: Towards Functional Constructs (de Vries, Carlson and Moscato) http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0102768 • Examining the drivers and brand performance implications of customer engagement with brands in the social media environment (de Vries and Carlson): http://www.palgrave- journals.com/bm/journal/v21/n6/abs/bm201418a.html • Gauging Heterogeneity in Online Consumer Behaviour Data: A Proximity Graph Approach (de Vries, Arefin and Moscato) http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7034833 • Engagement in Motion: Exploring Short Term Dynamics in Page-Level Social Media Metrics (Lucas et al) http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7034813&tag=1 • Identifying Communities of Trust and Confidence in the Charity and Not-for- Profit Sector: A Memetic Algorithm Approach (Moslemi et al) http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7034835&refinem ents%3D4251871666%26filter%3DAND%28p_IS_Number%3A7034739%29
  • 42. Other Sources First uses of ‘meta-features’: • Differences in Abundances of Cell-Signalling Proteins in Blood Reveal Novel Biomarkers for Early Detection Of Clinical Alzheimer's Disease (De Paula et al) http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0017481 • Unveiling Clusters of RNA Transcript Pairs Associated with Markers of Alzheimer’s Disease Progression (Arefin et al) http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0045535 MST-kNN papers: • An Integrated QAP-Based Approach to Visualize Patterns of Gene Expression Similarity (Inostroza Ponta et al) http://link.springer.com/chapter/10.1007/978-3- 540-76931-6_14 • kNN-MST-Agglomerative: A fast and scalable graph-based data clustering approach on GPU (Arefin et al) http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6295143

Notas del editor

  1. We have all heard the following “buzzwords”, keywords and topics this is what ‘traditional’ and social science have in common nowadays. Analysis of large datasets and development of scalable methods.
  2. Note about how computational methods are highly variable (computational linguistics)
  3. Only talk about this briefly and quickly. The only point is to highlight that the results using some sort of meta-feature were more significant
  4. Just talk about general comparison – doing the process with 3 datasets means finding more solid “structure” in the dataset