SlideShare una empresa de Scribd logo
1 de 25
Descargar para leer sin conexión
Multi-task Distributed Learning using
Vision Transformer with
Random Patch Permutation
Sangjoon Parka, Jong Chul Yea,b
BISPL - BioImaging, Signal Processing, and Learning Lab.
aDept. of Bio and Brain Engineering
bKim Jaechul Graduate School of AI
KAIST, Korea
Background
• Artificial intelligence has been gaining unprecedent popularity,
including medical imaging.
• To enable the AI models to offer precise decision support with
robustness, an enormous amount of data are indispensable.
• However, data collected from volunteer participation of a few
institutions cannot fully meet the amount to guarantee robustness.
Background
• Especially for the newly emerging disease like COVID-19, the
limitation can be exacerbated as it is hard to build a large, well-
curated dataset promptly.
• The ability to collaborate between institutions is critical for the
successful application of AI in medical imaging, but the rigorous
regulations and the ethical restrictions is an another obstacle.
• United States Health Insurance Portability and Accountability Act (HIPAA)
• European General Data Protection Regulation (GDPR)
Background
• Accordingly, distributed learning methods, which perform
learning tasks at edge devices in a distributed fashion, can be
effectively utilized in healthcare research.
• Federated learning (FL), Split learning (SL), Swarm learning, etc..
• However, there are still many limitations with the existing
distributed learning methods, which hinder widespread adoption.
• FL: dependency on client-side computation, gradient inversion attack
• SL: high communication overhead, feature hijacking, slower convergence
Distributed learning methods
Park et al. NeurIPS 2021
Existing distributed learning methods
Vision Transformer (ViT)
We focused on the model configuration and intrinsic properties of Vision
Transformer (ViT), suitable for multi-task distributed learning.
Dosovitsky et al. ICLR 2021
Federated Split Task-Agnostic (FeSTA) learning
Park et al. NeurIPS 2021
Federated Split Task-Agnostic (FeSTA) learning
Park et al. NeurIPS 2021
Limitations
Problem with FeSTA learning
1. Huge communication overheads.
• Even larger than FL or SL.
2. Marginal improvement in performance with MTL.
3. No method for privacy-preservation.
• Privacy threatening the same as the FL or SL.
Purpose
Here, we aim to develop a new algorithm dubbed “Federated Split
Task-Agnostic learning with permutating pure ViT (p-FeSTA)”, which
alleviate these drawbacks.
To this end, we adopted the pure ViT architecture and patch
permutation module, which enforce model to benefit more from MTL
and to offer better privacy.
Permutation invariance of self-attention mechanism
=
Self-attention mechanism is essentially “permutation invariant”.
Permutation invariance of ViT
Naser et al. NeurIPS 2021
Different from CNN, ViT is essentially patch-permutation invariant.
We hypothesized that the property of permutation invariance can be utilized for privacy-
preservation.
Model configuration of the proposed p-FeSTA
Learning process of the proposed p-FeSTA
Datasets for multi-task learning
We simulated a multi-task learning with three tasks: classification, severity prediction
(regression), and segmentation with 6 clients.
Class
CNUH
(Test)
KNUH
(Client #1)
BIMCV
(Client #2)
Normal 417 400 93
Other
infection
58 400 -
COVID-
19
81 293 782
Total 556 1,093 875
Severity
CNUH
(Test)
YNU
(Client #3)
Brixia
(Client #4)
1 26 63 261
2 11 59 443
3 8 25 414
4 7 35 866
5 12 18 745
6 17 86 1,536
Total 81 286 4,265
Data
Subset 1
(Test)
Subset 2
(Client #5)
Subset 3
(Client #6)
Total 1,000 4,840 4,839
Classification data
Severity data
Segmentation data
Performance comparison
Methods
Classification
(AUC)
Severity
(MSE)
Segmentation
(Dice)
Average Normal Others COVID-19
Data centralized 0.671 (0.051) 0.735 (0.071) 0.777 (0.045) 0.500 (0.051) 1.592 (0.081) 0.793 (0.005)
Federated learning 0.601 (0.036) 0.597 (0.146) 0.483 (0.068) 0.722 (0.023) 2.159 (0.188) 0.789 (0.001)
Split learning 0.546 (0.024) 0.522 (0.067) 0.534 (0.050) 0.583 (0.013) 2.546 (0.414) 0.790 (0.000)
FeSTA (STL) 0.718 (0.047) 0.680 (0.088) 0.677 (0.032) 0.795 (0.036) 1.318 (0.125) 0.801 (0.011)
p-FeSTA (STL) 0.696 (0.022) 0.739 (0.093) 0.557 (0.118) 0.790 (0.045) 1.848 (0.080) 0.803 (0.004)
FeSTA (MTL) 0.780 (0.019) 0.785 (0.009) 0.793 (0.100) 0.761 (0.034) 1.416 (0.048) 0.796 (0.013)
p-FeSTA (MTL) 0.884 (0.008) 0.906 (0.004) 0.890 (0.011) 0.857 (0.014) 1.361 (0.057) 0.808 (0.003)
Values are presented in mean (standard deviation) of three repeats with different seed.
Significantly reduced communication costs
Communication overheads
Where R = total rounds, P = number of parameters, n = avg. rounds, B = batch size,
F = features, G = gradients, D = total number of data
Significantly reduced communication costs
Total Features/gradients Parameters
Classification
FL 10456.152M - 10456.152M
SL 9474.048M 9474.048M -
FeSTA 11390.423M 9474.048M 1916.375M
p-FeSTA 4880.648M (42.8%) 4844.890M 35.758M
Severity
FL 11090.794M - 11090.794M
SL 9474.048M 9474.048M -
FeSTA 12025.065M 9474.048M 2551.017M
p-FeSTA 5435.649M (45.2%) 4765.249M 670.401M
Segmentation
FL 11160.985M - 11160.985M
SL 9474.048M 9474.048M -
FeSTA 12095.256M 9474.048M 2621.208M
p-FeSTA 5899.113M (48.8%) 5158.520M 740.592M
Privacy preservation with permutation module
Privacy preservation with permutation module
Permutation module makes it “underdetermined problem”.
Ablation studies
Methods
Classification
(AUC)
Severity
(MSE)
Segmentation
(Dice)
Average Normal Others COVID-19
Proposed
0.884
(0.008)
0.906
(0.004)
0.890
(0.011)
0.857
(0.014)
1.361
(0.057)
0.808 (0.003)
w learnable head
0.890
(0.001)
0.909
(0.014)
0.895
(0.005)
0.866
(0.013)
1.545
(0.386)
0.789 (0.000)
w/o permutation
0.890
(0.010)
0.909
(0.002)
0.904
(0.023)
0.858
(0.008)
1.461
(0.064)
0.809 (0.002)
w/o positional encoding
0.827
(0.028)
0.831
(0.035)
0.786
(0.049)
0.862
(0.007)
1.942
(0.112)
0.798 (0.004)
Values are presented in mean (standard deviation) of three repeats with different seed.
Comparison of the distributed learning methods
FL SL FeSTA p-FeSTA
Model averaging O X O O
Client-side learning Parallel Sequential Parallel Parallel
Model split X O O O
Communication cost High High High Low
Benefit from MTL X X Small Large
Privacy preservation X X X O
Conclusion & Summary
• We proposed the novel p-FESTA framework with pure ViT, which
elicits the synergy of MTL as well as reduces the communication
overhead significantly compared to the existing methods.
• In addition, we also enhanced the privacy using the Permutation
module in a way specific to ViT.
• We believe that our work is a step toward facilitating distributed
learning among the institutions wanting to participate in different
tasks.
Thank you for attention!
Q & A

Más contenido relacionado

Similar a 20220517_KoSAIM_sjp_v1.2.pdf

FACE RECOGNITION USING ELM-LRF
FACE RECOGNITION USING ELM-LRFFACE RECOGNITION USING ELM-LRF
FACE RECOGNITION USING ELM-LRFAras Masood
 
Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...BaoTramDuong2
 
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...ijtsrd
 
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...ijsc
 
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...ijsc
 
Disease Identification and Detection in Apple Tree
Disease Identification and Detection in Apple TreeDisease Identification and Detection in Apple Tree
Disease Identification and Detection in Apple Treeijtsrd
 
Neural networks for the prediction and forecasting of water resources variables
Neural networks for the prediction and forecasting of water resources variablesNeural networks for the prediction and forecasting of water resources variables
Neural networks for the prediction and forecasting of water resources variablesJonathan D'Cruz
 
SURVEY PAPER ON OUT LIER DETECTION USING FUZZY LOGIC BASED METHOD
SURVEY PAPER ON OUT LIER DETECTION USING FUZZY LOGIC BASED METHODSURVEY PAPER ON OUT LIER DETECTION USING FUZZY LOGIC BASED METHOD
SURVEY PAPER ON OUT LIER DETECTION USING FUZZY LOGIC BASED METHODIJCI JOURNAL
 
IRJET- A Privacy Leakage Upper Bound Constraint-Based Approach for Cost-E...
IRJET-  	  A Privacy Leakage Upper Bound Constraint-Based Approach for Cost-E...IRJET-  	  A Privacy Leakage Upper Bound Constraint-Based Approach for Cost-E...
IRJET- A Privacy Leakage Upper Bound Constraint-Based Approach for Cost-E...IRJET Journal
 
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNINGSEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNINGgerogepatton
 
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNINGSEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNINGgerogepatton
 
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNINGSEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNINGijaia
 
Ccids 2019 cutting edges of ai technology in medicine
Ccids 2019 cutting edges of ai technology in medicineCcids 2019 cutting edges of ai technology in medicine
Ccids 2019 cutting edges of ai technology in medicineNamkug Kim
 
IRJET - Survey on Analysis of Breast Cancer Prediction
IRJET - Survey on Analysis of Breast Cancer PredictionIRJET - Survey on Analysis of Breast Cancer Prediction
IRJET - Survey on Analysis of Breast Cancer PredictionIRJET Journal
 
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...Damian R. Mingle, MBA
 
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...cscpconf
 
A MACHINE LEARNING METHODOLOGY FOR DIAGNOSING CHRONIC KIDNEY DISEASE
A MACHINE LEARNING METHODOLOGY FOR DIAGNOSING CHRONIC KIDNEY DISEASEA MACHINE LEARNING METHODOLOGY FOR DIAGNOSING CHRONIC KIDNEY DISEASE
A MACHINE LEARNING METHODOLOGY FOR DIAGNOSING CHRONIC KIDNEY DISEASEIRJET Journal
 
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...ahmad abdelhafeez
 

Similar a 20220517_KoSAIM_sjp_v1.2.pdf (20)

FACE RECOGNITION USING ELM-LRF
FACE RECOGNITION USING ELM-LRFFACE RECOGNITION USING ELM-LRF
FACE RECOGNITION USING ELM-LRF
 
Madhavi
MadhaviMadhavi
Madhavi
 
Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...Discover How Scientific Data is Used for the Public Good with Natural Languag...
Discover How Scientific Data is Used for the Public Good with Natural Languag...
 
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
Adaptive Classification of Imbalanced Data using ANN with Particle of Swarm O...
 
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
AN EFFICIENT PSO BASED ENSEMBLE CLASSIFICATION MODEL ON HIGH DIMENSIONAL DATA...
 
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
An Efficient PSO Based Ensemble Classification Model on High Dimensional Data...
 
Disease Identification and Detection in Apple Tree
Disease Identification and Detection in Apple TreeDisease Identification and Detection in Apple Tree
Disease Identification and Detection in Apple Tree
 
Neural networks for the prediction and forecasting of water resources variables
Neural networks for the prediction and forecasting of water resources variablesNeural networks for the prediction and forecasting of water resources variables
Neural networks for the prediction and forecasting of water resources variables
 
SURVEY PAPER ON OUT LIER DETECTION USING FUZZY LOGIC BASED METHOD
SURVEY PAPER ON OUT LIER DETECTION USING FUZZY LOGIC BASED METHODSURVEY PAPER ON OUT LIER DETECTION USING FUZZY LOGIC BASED METHOD
SURVEY PAPER ON OUT LIER DETECTION USING FUZZY LOGIC BASED METHOD
 
IRJET- A Privacy Leakage Upper Bound Constraint-Based Approach for Cost-E...
IRJET-  	  A Privacy Leakage Upper Bound Constraint-Based Approach for Cost-E...IRJET-  	  A Privacy Leakage Upper Bound Constraint-Based Approach for Cost-E...
IRJET- A Privacy Leakage Upper Bound Constraint-Based Approach for Cost-E...
 
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNINGSEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
 
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNINGSEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
 
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNINGSEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
SEGMENTATION OF THE GASTROINTESTINAL TRACT MRI USING DEEP LEARNING
 
Ccids 2019 cutting edges of ai technology in medicine
Ccids 2019 cutting edges of ai technology in medicineCcids 2019 cutting edges of ai technology in medicine
Ccids 2019 cutting edges of ai technology in medicine
 
IRJET - Survey on Analysis of Breast Cancer Prediction
IRJET - Survey on Analysis of Breast Cancer PredictionIRJET - Survey on Analysis of Breast Cancer Prediction
IRJET - Survey on Analysis of Breast Cancer Prediction
 
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...
A discriminative-feature-space-for-detecting-and-recognizing-pathologies-of-t...
 
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
PREDICTION OF MALIGNANCY IN SUSPECTED THYROID TUMOUR PATIENTS BY THREE DIFFER...
 
[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar
[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar
[IJET-V2I3P21] Authors: Amit Kumar Dewangan, Akhilesh Kumar Shrivas, Prem Kumar
 
A MACHINE LEARNING METHODOLOGY FOR DIAGNOSING CHRONIC KIDNEY DISEASE
A MACHINE LEARNING METHODOLOGY FOR DIAGNOSING CHRONIC KIDNEY DISEASEA MACHINE LEARNING METHODOLOGY FOR DIAGNOSING CHRONIC KIDNEY DISEASE
A MACHINE LEARNING METHODOLOGY FOR DIAGNOSING CHRONIC KIDNEY DISEASE
 
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
Robust Breast Cancer Diagnosis on Four Different Datasets Using Multi-Classif...
 

Último

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 

Último (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

20220517_KoSAIM_sjp_v1.2.pdf

  • 1. Multi-task Distributed Learning using Vision Transformer with Random Patch Permutation Sangjoon Parka, Jong Chul Yea,b BISPL - BioImaging, Signal Processing, and Learning Lab. aDept. of Bio and Brain Engineering bKim Jaechul Graduate School of AI KAIST, Korea
  • 2. Background • Artificial intelligence has been gaining unprecedent popularity, including medical imaging. • To enable the AI models to offer precise decision support with robustness, an enormous amount of data are indispensable. • However, data collected from volunteer participation of a few institutions cannot fully meet the amount to guarantee robustness.
  • 3. Background • Especially for the newly emerging disease like COVID-19, the limitation can be exacerbated as it is hard to build a large, well- curated dataset promptly. • The ability to collaborate between institutions is critical for the successful application of AI in medical imaging, but the rigorous regulations and the ethical restrictions is an another obstacle. • United States Health Insurance Portability and Accountability Act (HIPAA) • European General Data Protection Regulation (GDPR)
  • 4. Background • Accordingly, distributed learning methods, which perform learning tasks at edge devices in a distributed fashion, can be effectively utilized in healthcare research. • Federated learning (FL), Split learning (SL), Swarm learning, etc.. • However, there are still many limitations with the existing distributed learning methods, which hinder widespread adoption. • FL: dependency on client-side computation, gradient inversion attack • SL: high communication overhead, feature hijacking, slower convergence
  • 5. Distributed learning methods Park et al. NeurIPS 2021 Existing distributed learning methods
  • 6. Vision Transformer (ViT) We focused on the model configuration and intrinsic properties of Vision Transformer (ViT), suitable for multi-task distributed learning. Dosovitsky et al. ICLR 2021
  • 7. Federated Split Task-Agnostic (FeSTA) learning Park et al. NeurIPS 2021
  • 8. Federated Split Task-Agnostic (FeSTA) learning Park et al. NeurIPS 2021
  • 9. Limitations Problem with FeSTA learning 1. Huge communication overheads. • Even larger than FL or SL. 2. Marginal improvement in performance with MTL. 3. No method for privacy-preservation. • Privacy threatening the same as the FL or SL.
  • 10. Purpose Here, we aim to develop a new algorithm dubbed “Federated Split Task-Agnostic learning with permutating pure ViT (p-FeSTA)”, which alleviate these drawbacks. To this end, we adopted the pure ViT architecture and patch permutation module, which enforce model to benefit more from MTL and to offer better privacy.
  • 11. Permutation invariance of self-attention mechanism = Self-attention mechanism is essentially “permutation invariant”.
  • 12. Permutation invariance of ViT Naser et al. NeurIPS 2021 Different from CNN, ViT is essentially patch-permutation invariant. We hypothesized that the property of permutation invariance can be utilized for privacy- preservation.
  • 13. Model configuration of the proposed p-FeSTA
  • 14. Learning process of the proposed p-FeSTA
  • 15. Datasets for multi-task learning We simulated a multi-task learning with three tasks: classification, severity prediction (regression), and segmentation with 6 clients. Class CNUH (Test) KNUH (Client #1) BIMCV (Client #2) Normal 417 400 93 Other infection 58 400 - COVID- 19 81 293 782 Total 556 1,093 875 Severity CNUH (Test) YNU (Client #3) Brixia (Client #4) 1 26 63 261 2 11 59 443 3 8 25 414 4 7 35 866 5 12 18 745 6 17 86 1,536 Total 81 286 4,265 Data Subset 1 (Test) Subset 2 (Client #5) Subset 3 (Client #6) Total 1,000 4,840 4,839 Classification data Severity data Segmentation data
  • 16. Performance comparison Methods Classification (AUC) Severity (MSE) Segmentation (Dice) Average Normal Others COVID-19 Data centralized 0.671 (0.051) 0.735 (0.071) 0.777 (0.045) 0.500 (0.051) 1.592 (0.081) 0.793 (0.005) Federated learning 0.601 (0.036) 0.597 (0.146) 0.483 (0.068) 0.722 (0.023) 2.159 (0.188) 0.789 (0.001) Split learning 0.546 (0.024) 0.522 (0.067) 0.534 (0.050) 0.583 (0.013) 2.546 (0.414) 0.790 (0.000) FeSTA (STL) 0.718 (0.047) 0.680 (0.088) 0.677 (0.032) 0.795 (0.036) 1.318 (0.125) 0.801 (0.011) p-FeSTA (STL) 0.696 (0.022) 0.739 (0.093) 0.557 (0.118) 0.790 (0.045) 1.848 (0.080) 0.803 (0.004) FeSTA (MTL) 0.780 (0.019) 0.785 (0.009) 0.793 (0.100) 0.761 (0.034) 1.416 (0.048) 0.796 (0.013) p-FeSTA (MTL) 0.884 (0.008) 0.906 (0.004) 0.890 (0.011) 0.857 (0.014) 1.361 (0.057) 0.808 (0.003) Values are presented in mean (standard deviation) of three repeats with different seed.
  • 17. Significantly reduced communication costs Communication overheads Where R = total rounds, P = number of parameters, n = avg. rounds, B = batch size, F = features, G = gradients, D = total number of data
  • 18. Significantly reduced communication costs Total Features/gradients Parameters Classification FL 10456.152M - 10456.152M SL 9474.048M 9474.048M - FeSTA 11390.423M 9474.048M 1916.375M p-FeSTA 4880.648M (42.8%) 4844.890M 35.758M Severity FL 11090.794M - 11090.794M SL 9474.048M 9474.048M - FeSTA 12025.065M 9474.048M 2551.017M p-FeSTA 5435.649M (45.2%) 4765.249M 670.401M Segmentation FL 11160.985M - 11160.985M SL 9474.048M 9474.048M - FeSTA 12095.256M 9474.048M 2621.208M p-FeSTA 5899.113M (48.8%) 5158.520M 740.592M
  • 19. Privacy preservation with permutation module
  • 20. Privacy preservation with permutation module Permutation module makes it “underdetermined problem”.
  • 21. Ablation studies Methods Classification (AUC) Severity (MSE) Segmentation (Dice) Average Normal Others COVID-19 Proposed 0.884 (0.008) 0.906 (0.004) 0.890 (0.011) 0.857 (0.014) 1.361 (0.057) 0.808 (0.003) w learnable head 0.890 (0.001) 0.909 (0.014) 0.895 (0.005) 0.866 (0.013) 1.545 (0.386) 0.789 (0.000) w/o permutation 0.890 (0.010) 0.909 (0.002) 0.904 (0.023) 0.858 (0.008) 1.461 (0.064) 0.809 (0.002) w/o positional encoding 0.827 (0.028) 0.831 (0.035) 0.786 (0.049) 0.862 (0.007) 1.942 (0.112) 0.798 (0.004) Values are presented in mean (standard deviation) of three repeats with different seed.
  • 22. Comparison of the distributed learning methods FL SL FeSTA p-FeSTA Model averaging O X O O Client-side learning Parallel Sequential Parallel Parallel Model split X O O O Communication cost High High High Low Benefit from MTL X X Small Large Privacy preservation X X X O
  • 23. Conclusion & Summary • We proposed the novel p-FESTA framework with pure ViT, which elicits the synergy of MTL as well as reduces the communication overhead significantly compared to the existing methods. • In addition, we also enhanced the privacy using the Permutation module in a way specific to ViT. • We believe that our work is a step toward facilitating distributed learning among the institutions wanting to participate in different tasks.
  • 24. Thank you for attention!
  • 25. Q & A