Camlis

•Descargar como PPTX, PDF•

1 recomendación•227 vistas

While data privacy challenges long predate current trends in machine-learning-as-a-service (MLAAS) offerings, predictive APIs do expose significant new attack vectors. To provide users with tailored recommendations, these applications often expose endpoints either to dynamic models or to pre-trained model artifacts, which learn patterns from data to surface insights. Problems arise when training data are collected, stored, and modeled in ways that jeopardize privacy. Even when user data is not exposed directly, private information can often be inferred using a technique called model inversion. In this talk, I discuss current research in black box model inversion and present a machine learning approach to discovering the model families of deployed black box models using only their decision topologies. Prior work suggests the efficacy of model family specific attack vectors (i.e., once the model is no longer a black box, it is easier to exploit). As such, we approach the problem only of model discovery and not of model inversion, reasoning that by solving the problem of model identification, we clear a path for information security and cryptography experts to use domain-specific tools for model inversion.

Datos y análisis

Inferring Model Families
from Deployed Black Boxes
Dr. Rebecca Bilbro
CAMLIS 2018

Rebecca Bilbro
Co-creator & Core Contrib, Scikit-Yb
Adjunct Faculty, Georgetown Univ.
Emeritus, Data Community DC
github.com/rebeccabilbro
twitter.com/rebeccabilbro

Just anonymize the data?
ID Name SSN Age Ethnicity Condition
1 redacted redacted 15
African
American
Bronchitis
2 redacted redacted 15 Caucasian Bronchitis
3 redacted redacted 17 Hispanic Asthma
4 redacted redacted 17 Hispanic Eczema
5 redacted redacted 17
African
American
Eczema
6 redacted redacted 18
Asian
American
HIV/AIDS
7 redacted redacted 18
Asian
American
HIV/AIDS

Nope, not differentially private
ID Name SSN Age Ethnicity Condition
1 redacted redacted 15
African
American
Bronchitis
2 redacted redacted 15 Caucasian Bronchitis
3 redacted redacted 17 Hispanic Asthma
4 redacted redacted 17 Hispanic Eczema
5 redacted redacted 17
African
American
Eczema
6 redacted redacted 18
Asian
American
HIV/AIDS
7 redacted redacted 18
Asian
American
HIV/AIDS

Safety in black boxes?
Automated
Build
Data Insight

training data
fitted model
application interface
user

training data
fitted model
application interface
user
Oops

Useful for Model Inversion
● Linearity: the more linear the model, the easier to perturb (Goodfellow et al.
2015)
● Prediction metadata: confidence scores, class prediction probabilities, or
decision functions make inversion easier (Fredrickson et al. 2015)
● Commercial MLAAS: reverse-engineering is easy because the models,
hyperparameters used for training are known (Tràmer et al. 2016)
● Deployed black boxes: private training data can be extracted from prediction
behavior (Song et al. 2017)

How much can be
determined about a
fitted model?

● Open source Python library,
extends Scikit-Learn API.
● Model (not data) visualization.
● Tools for feature engineering,
visual diagnostics, evaluation,
and steering.
● Enhances the model
selection process.
Yellowbrick
E.g. ScoreVisualizers to gauge accuracy
and diagnose problems like overfit and
heteroskedasticity

How can we anticipate
model-specific attack
vectors?

First, some definitions
“‘Model’ is an overloaded term.” - Hadley Wickham (2015)
● Model family: high-level relationships between
variables of interest.
● Model form: specific relationships between
variables inside model family framework.
● Fitted model: concrete instance of model form
where all parameters have been estimated from
data; used to generate predictions.
Do fitted models
exhibit distinctive
topologies you
could use to infer
family or form?

How noisy
was the
original
data?
How much
noise to
subvert
inversion?

Add more
smoothing
than is
strictly
necessary,
so long as it
doesn’t
increase
error?

Inspect the
spread of
class
predictions
from the
average?

Más contenido relacionado

Similar a Camlis

Explainability and bias in AI

Bill Liu

Ethical Dilemmas in AI/ML-based systems

Dr. Kim (Kyllesbech Larsen)

Measuring Model Fairness - Stephen Hoover

PyData

Overview of GSK Machine Learning and Artificial Intelligence activities, by Kim Branson, SVP and Head of AI at GSK Pharma, November 3rd, 2021. AI methods are becoming widely used due to the exponential nature of data generation. AI is used to collect the data, process it, derive causal relations. AI is being used to aid design the next experiment in an efficient manner. (RL, Bandits ..). The exponential nature of data improves AI in a virtuous cycle. Target discovery: integration of Functional Genomic, Genetic and other data and other sources for target discovery. Companion Software: for each asset we we will generate software for stratification, and individual response prediction Fundamental AI Research: Fundamental research into causal machine learning, automated machine learning, and multi modal data combination. We are developing a feedback loop for each AI system we build. We have best in industry full automated discovery biology robotics. We ask the model what data it needs. We only know what to do with 15% of the genetic variants we obtain from genetic association studies. How do we unlock all the value of our investments in genetic data? We build AI for Variant to Gene Prediction: It transforms a complex genetic locus, To a ranked list of candidate genes with confidence bounds, That are tested experimentally through Functional Genomics. Variant to Gene AI: A multi AI system for solving the variant to gene problem. Teaching our AI what we know about the world- Internal and external data, GSK AI team developed a custom NLP model for biomedical data, Knowledge Graph of all data. Data becomes a critical factor for AI success. Private Data Sources, Generate data allow us determine the Value of other public / private sources. Models trained on private and public Data are unique. Common Public data sources. Moving Beyond medical records for cohort definition. Image Derived Phenotype (IDP) discovery & generation using AI/ML. Computational companion diagnostics and learning from clinical trials. Focusing on Computational Pathology- Applying the advances in AI for image analysis. Tissues are collected as part of the biopsy for pathology. Digital versions of these H&E slides as a tool for diagnosis/prognosis by human pathologist. What else can we do with this image data? Genetic differences are not human discernible. Currently determined by sequencing the tumor. Should we be constrained by human ability? AI can determine HRD genetic status from image.

AI at GSK_Kim Branson_mHealth Israel

Levi Shapiro

Evolution of Knowledge Discovery and Management

inscit2006

This tutorial seeks to showcase AI strategies that provide medical context to patient data with the help of a knowledge graph. This supports personalization through a personalized knowledge graph that captures the patient’s personalized health management objectives within the context of the clinical guidelines and care plan. The continuous capture of this information through the analysis of patient-VHA interactions, and the strategy of creating engaging interactions (conversations) can further augment the personalized knowledge graph. These operations are required to support self-appraisal and self-management, and when necessary perform fail-safe tasks such as connecting the patient to a crisis help-line or professional help. The core innovation is the use of a novel knowledge-infused reinforcement learning method. The by-product of this approach leads to transparency in decision-making with the ability to offer a user understandable explanation. https://www.knowledgegraph.tech/kgc-2022-tutorial-knowledge-infused-reinforcement-learning/ More Information: https://aiisc.ai/kirl/

KGCTutorial_AIISC_2022.pptx

Artificial Intelligence Institute at UofSC

Everything You Always Wanted to Know About Synthetic Data

MOSTLY AI

Bias in AI

Zeydy Ortiz, Ph. D.

FAIR as a Working Principle for Cancer Genomic Data

Ian Fore

Fore FAIR ISMB 2019

Ian Fore

The rapid development and spread of analytical tools in the biomedical sciences has produced a variety of information about all sorts of biological components and their functions. Though important individually, their biological characteristics need to be understood in relation to the interactions they have with other biological components, which requires the integration of vast amounts of complex, semantically-rich, heterogenous data. Traditional systems are inadequate at accurately modelling and handling data at this scale and complexity, making solutions that speed up the integration and querying of such data a necessity. In this talk, we present various approaches being used in organisations to build biomedical computational pipelines to address these problems using tools such as Machine Learning and TypeDB. In particular, we discuss how to create an accurate and scalable semantic representation of molecular level biomedical data by presenting examples from drug discovery, precision medicine and competitive intelligence. Speaker: Tomás Sabat Tomás is the Chief Operating Officer at Vaticle, dedicated to building a strongly-typed database for intelligent systems. He works directly with TypeDB's open source and enterprise users so they can fulfil their potential with TypeDB and change the world. He focuses mainly in life sciences, cyber security, finance and robotics.

Building Biomedical Knowledge Graphs for In-Silico Drug Discovery

Vaticle

NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf

Lora Aroyo

M2 l10 fairness, accountability, and transparency

BoPeng76

[Video available at https://sites.google.com/view/ResponsibleAITutorial] Artificial Intelligence is increasingly being used in decisions and processes that are critical for individuals, businesses, and society, especially in areas such as hiring, lending, criminal justice, healthcare, and education. Recent ethical challenges and undesirable outcomes associated with AI systems have highlighted the need for regulations, best practices, and practical tools to help data scientists and ML developers build AI systems that are secure, privacy-preserving, transparent, explainable, fair, and accountable – to avoid unintended and potentially harmful consequences and compliance challenges. In this tutorial, we will present an overview of responsible AI, highlighting model explainability, fairness, and privacy in AI, key regulations/laws, and techniques/tools for providing understanding around AI/ML systems. Then, we will focus on the application of explainability, fairness assessment/unfairness mitigation, and privacy techniques in industry, wherein we present practical challenges/guidelines for using such techniques effectively and lessons learned from deploying models for several web-scale machine learning and data mining applications. We will present case studies across different companies, spanning many industries and application domains. Finally, based on our experiences in industry, we will identify open problems and research directions for the AI community.

Responsible AI in Industry (ICML 2021 Tutorial)

Krishnaram Kenthapadi

AIF360 - Trusted and Fair AI

Animesh Singh

Digitas Bias in Data Science

ParamdeepKhangura

Getting comfortable with Data

Ritvvij Parrikh

Outline: - Fact checking – what is it and why do we need it? - False information online - Content-based automatic fact checking - Explainability – what is it and why do we need it? - Making the right predictions for the right reasons - Model training pipeline - Explainable fact checking – some first solutions - Rationale selection - Generating free-text explanations - Wrap-up

Towards Explainable Fact Checking (DIKU Business Club presentation)

Isabelle Augenstein

Computational biology has revolutionised biomedicine. The volume of data it is generating is growing exponentially. This requires tools that enable computational and non-computational biologists to collaborate and derive meaningful insights. However, traditional systems are inadequate to accurately model and handle data at this scale and complexity. In this talk, we discuss how TypeDB enables biologists to build a deeper understanding of life, and increase the probability of groundbreaking discoveries, across the life sciences. Speaker: Tomás Sabat Tomás is the Chief Operating Officer at Vaticle. He works closely with TypeDB's open source and enterprise users who use TypeDB to build applications in a wide number of industries including financial services, life sciences, cybersecurity and supply chain management. A graduate of the University of Cambridge, Tomás has spent the last seven years founding and building businesses in the technology industry.

Enabling the Computational Future of Biology.pdf

Vaticle

AI Bias Oxford 2017

Dr Janet Bastiman

Similar a Camlis (20)

Explainability and bias in AI

Ethical Dilemmas in AI/ML-based systems

Measuring Model Fairness - Stephen Hoover

AI at GSK_Kim Branson_mHealth Israel

Evolution of Knowledge Discovery and Management

KGCTutorial_AIISC_2022.pptx

Everything You Always Wanted to Know About Synthetic Data

Bias in AI

FAIR as a Working Principle for Cancer Genomic Data

Fore FAIR ISMB 2019

Building Biomedical Knowledge Graphs for In-Silico Drug Discovery

NeurIPS2023 Keynote: The Many Faces of Responsible AI.pdf

M2 l10 fairness, accountability, and transparency

Responsible AI in Industry (ICML 2021 Tutorial)

AIF360 - Trusted and Fair AI

Digitas Bias in Data Science

Getting comfortable with Data

Towards Explainable Fact Checking (DIKU Business Club presentation)

Enabling the Computational Future of Biology.pdf

AI Bias Oxford 2017

Más de Rebecca Bilbro

In this talk we present a decentralized messaging protocol and storage system in use across North America, Europe, and South East Asia. Built at the behest of an international nonprofit working group, the protocol and system are designed to address a unique problem at the intersection of financial crime regulation, distributed ledger technology, and user privacy. In our talk we discuss the many lessons learned in the process of architecting, implementing, and fostering adoption of this system. We present the Secure Envelope, a data structure that employs a combination of methods (symmetric and asymmetric encryption, mTLS, protocol buffers, etc) to safeguard data privacy both at rest and in flight.

Data Structures for Data Privacy: Lessons Learned in Production

Rebecca Bilbro

Jupyter Notebook may be one of the most controversial open source projects released in the last ten years! Love them or hate them, they’ve become a mainstay of data science and machine learning, and a significant part of the Python ecosystem. While Jupyter can simplify experimentation, rapid prototyping, documentation, and visualization, it often impedes version control, code review, and test coverage. Dev teams must accept the good with the bad… but what if they didn’t have to? In this talk we introduce conflict-free replicated data types (CRDT), a special object that supports strong consistency, and which can be used to enhance Jupyter notebooks for a truly collaborative experience. First proposed by Shapiro et al in 2011 conflict-free replicated data types (CRDTs) evolved out of the Distributed Systems community for replication of data across a network of replicas. CRDTs are objects that come with a special guarantee — namely, that two different copies of that object can be strongly consistent, meaning they can be kept in sync. While CRDTs have enjoyed a good amount of attention from academia over the last years, primarily amongst database and cloud researchers, they have not led to many practical applications for everyday developers. However, recent work by Kleppmann et al shows CRDTs can be used for real-time rich-text collaboration — creating a “Google doc”-type experience with any document in a networked file system. In this talk, we’ll present the basics of CRDTs and demonstrate how they work with examples written in Python. Next, we’ll explain how CRDTs can enable more collaborative Jupyter notebooks, opening up features such as synchronous insertions, diffs, and auto-merges, even with multiple simultaneous contributors!

Conflict-Free Replicated Data Types (PyCon 2022)

Rebecca Bilbro

Despite the hype cycle, each day machine learning becomes a little less magic and a little more real. Predictions increasingly drive our everyday lives, embedded into more of our everyday applications. To support this creative surge, development teams are evolving, integrating novel open source software and state-of-the-art GPU hardware, and bringing on essential new teammates like data ethicists and machine learning engineers. Software teams are also now challenged to build and maintain codebases that are intentionally not fully deterministic. This nondeterminism can manifest in a number of surprising and oftentimes very stressful ways! Successive runs of model training may produce slight but meaningful variations. Data wrangling pipelines turn out to be extremely sensitive to the order in which transformations are applied, and require thoughtful orchestration to avoid leakage. Model hyperparameters that can be tuned independently may have mutually exclusive conditions. Models can also degrade over time, producing increasingly unreliable predictions. Moreover, open source libraries are living, dynamic things; the latest release of your team's favorite library might cause your code to suddenly behave in unexpected ways. Put simply, as ML becomes more of an expectation than an exception in our industry, testing has never been more important! Fortunately, we are lucky to have a rich open source ecosystem to support us in our journey to build the next generation of apps in a safe, stable way. In this talk we'll share some hard-won lessons, favorite open source packages, and reusable techniques for testing ML software components.

(Py)testing the Limits of Machine Learning

Rebecca Bilbro

Anti-Entropy Replication for Cost-Effective Eventual Consistency

Rebecca Bilbro

In the machine learning community, we're trained to think of size as inversely proportional to bias, driving us to ever larger datasets, increasingly complex model architectures, and ever better accuracy scores. But bigger doesn't always mean better. What data quality issues emerge in large datasets? What complications surface as features become more geodistributed (e.g., diurnal patterns, seasonal variations, datetime formatting, multilingual text, etc.)? What happens as models attempt to extrapolate bigger and bigger patterns? Why is it that the pursuit of megamodels has driven a wedge between the ML definition of “bias” and the more colloquial sense of the word? Perhaps the time has come to move away from monolithic models that reduce rich variations and complexities to a simple argmax on the output layer and instead embrace a new generation of model architectures that are just as organic and diverse as the data they seek to encode.

The Promise and Peril of Very Big Models

Rebecca Bilbro

There have never been more commercial tools available for building distributed data apps — from cloud hosting services, to cloud-native databases, to cloud-based analytics platforms. So why is it still so hard to make a successful app with a global user base? One of the toughest challenges cloud offerings take on is the problem of consensus, abstracting away most of the complexity. That's no small feat, given that this is a hard enough problem that people spend years getting a PhD just to understand it! Unfortunately, while buying off-the-shelf cloud services can accelerate the path to an MVP, it also makes optimization tough. How will we scale during a period of rapid user growth? How do we do I18n and l10n or guarantee a good UX for users on the other side of the world? How do we prevent replication that might get us into legal trouble? In this talk, we'll consider several case studies of global apps (both successful and otherwise!), talk about the limitations of off-the-shelf consensus, and consider a future where everyday developers can use open source tools to build distributed data apps that are easier to reason about, maintain, and tune.

Beyond Off the-Shelf Consensus

Rebecca Bilbro

We live with an abundance of ML resources; from open source tools, to GPU workstations, to cloud-hosted autoML. What’s more, the lines between AI research and everyday ML have blurred; you can recreate a state-of-the-art model from arxiv papers at home. But can you afford to? In this talk, we explore ways to recession-proof your ML process without sacrificing on accuracy, explainability, or value.

PyData Global: Thrifty Machine Learning

Rebecca Bilbro

The hunt for the most effective machine learning model is hard enough with a modest dataset, and much more so as our data grow! As we search for the optimal combination of features, algorithm, and hyperparameters, we often use tools like histograms, heatmaps, embeddings, and other plots to make our processes more informed and effective. However, large, high-dimensional datasets can prove particularly challenging. In this talk, we’ll explore a suite of visual diagnostics, investigate their strengths and weaknesses in face of increasingly big data, and consider how we can steer the machine learning process, not only purposefully but at scale!

EuroSciPy 2019: Visual diagnostics at scale

Rebecca Bilbro

Visual diagnostics at scale

Rebecca Bilbro

Machine learning is ultimately a search for the best combination of features, algorithm, and hyperparameters that result in the best performing model. Oftentimes, this leads us to stay in our algorithmic comfort zones, or to resort to automated processes such as grid searches and random walks. Whether we stick to what we know or try many combinations, we are sometimes left wondering if we have actually succeeded. By enhancing model selection with visual diagnostics, data scientists can inject human guidance to steer the search process. Visualizing feature transformations, algorithmic behavior, cross-validation methods, and model performance allows us a peek into the high dimensional realm that our models operate. As we continue to tune our models, trying to minimize both bias and variance, these glimpses allow us to be more strategic in our choices. The result is more effective modeling, speedier results, and greater understanding of underlying processes. Visualization is an integral part of the data science workflow, but visual diagnostics are directly tied to machine learning transformers and models. The Yellowbrick library extends the scikit-learn API providing a Visualizer object, an estimator that learns from data and produces a visualization as a result. In this tutorial, we will explore feature visualizers, visualizers for classification, clustering, and regression, as well as model analysis visualizers. We'll work through several examples and show how visual diagnostics steer model selection, making machine learning more informed, and more effective.

Steering Model Selection with Visual Diagnostics: Women in Analytics 2019

Rebecca Bilbro

A Visual Exploration of Distance, Documents, and Distributions

Rebecca Bilbro

Words in space

Rebecca Bilbro

The last decade saw advances in compute power combine with an avalanche of open source software development, resulting in a revolution in machine learning and scalable analytics. “Data science” and “data product” are now household terms. This led to a new job description, the Data Scientist, which quickly became one of the most significant, exciting, and misunderstood jobs of the 21st century. One part statistician, one part computer scientist, and one part domain expert, data scientists seem poised to become the most pivotal value creators of the information age. And yet, danger (supposedly) lies ahead: human decisions are increasingly outsourced to algorithms of questionable ethical design; we’re putting everything on the blockchain; and perhaps most disturbingly, data science salaries are dropping precipitously as new graduates and Machine Learning as a Service (MLaaS) offerings flood the market. As we move into a future where predictive analytics is no longer a differentiator but instead a core business function, will data scientists proliferate or be automated out of a job? In this talk, one humble data scientist attempts to cut through the hype to present an alternate vision of what data science is and can become. If not the “Sexiest Job of the 21st Century" as the Harvard Business Review once quipped, what is it like to be a workaday data scientist? What problems are we solving? How do we integrate with mature engineering teams? How do we engage with clients and product owners? How do we deploy non-deterministic models in production? In particular, we’ll examine critical integration points — technological and otherwise — we are currently tackling, which will ultimately determine our success, and our viability, over the next 10 years.

The Incredible Disappearing Data Scientist

Rebecca Bilbro

Learning machine learning with Yellowbrick

Rebecca Bilbro

Escaping the Black Box

Rebecca Bilbro

As the applications we build are increasingly driven by text, doing data ingestion, management, loading, and preprocessing in a robust, organized, parallel, and memory-safe way can get tricky. This talk walks through the highs (a custom billion-word corpus!), the lows (segfaults, 400 errors, pesky mp3s), and the new Python libraries we built to ingest and preprocess text for machine learning. While applications like Siri, Cortana, and Alexa may still seem like novelties, language-aware applications are rapidly becoming the new norm. Under the hood, these applications take in text data as input, parse it into composite parts, compute upon those composites, and then recombine them to deliver a meaningful and tailored end result. The best applications use language models trained on domain-specific corpora (collections of related documents containing natural language) that reduce ambiguity and prediction space to make results more intelligible. Here's the catch: these corpora are huge, generally consisting of at least hundreds of gigabytes of data inside of thousands of documents, and often more! In this talk, we'll see how working with text data is substantially different from working with numeric data, and show that ingesting a raw text corpus in a form that will support the construction of a data product is no trivial task. For instance, when dealing with a text corpus, you have to consider not only how the data comes in (e.g. respecting rate limits, terms of use, etc.), but also where to store the data and how to keep it organized. Because the data comes from the web, it's often unpredictable, containing not only text but audio files, ads, videos, and other kinds of web detritus. Since the datasets are large, you need to anticipate potential performance problems and ensure memory safety through streaming data loading and multiprocessing. Finally, in anticipation of the machine learning components, you have to establish a standardized method of transforming your raw ingested text into a corpus that's ready for computation and modeling. In this talk, we'll explore many of the challenges we experienced along the way and introduce two Python packages that make this work a bit easier: Baleen and Minke. Baleen is a package for ingesting formal natural language data from the discourse of professional and amateur writers, like bloggers and news outlets, in a categorized fashion. Minke extends Baleen with a library that performs parallel data loading, preprocessing, normalization, and keyphrase extraction to support machine learning on a large-scale custom corpus.

Data Intelligence 2017 - Building a Gigaword Corpus

Rebecca Bilbro

Building a Gigaword Corpus (PyCon 2017)

Rebecca Bilbro

In machine learning, model selection is a bit more nuanced than simply picking the 'right' or 'wrong' algorithm. In practice, the workflow includes (1) selecting and/or engineering the smallest and most predictive feature set, (2) choosing a set of algorithms from a model family, and (3) tuning the algorithm hyperparameters to optimize performance. Recently, much of this workflow has been automated through grid search methods, standardized APIs, and GUI-based applications. In practice, however, human intuition and guidance can more effectively hone in on quality models than exhaustive search. This talk presents a new Python library, Yellowbrick, which extends the Scikit-Learn API with a visual transfomer (visualizer) that can incorporate visualizations of the model selection process into pipelines and modeling workflow. Yellowbrick is an open source, pure Python project that extends Scikit-Learn with visual analysis and diagnostic tools. The Yellowbrick API also wraps matplotlib to create publication-ready figures and interactive data explorations while still allowing developers fine-grain control of figures. For users, Yellowbrick can help evaluate the performance, stability, and predictive value of machine learning models, and assist in diagnosing problems throughout the machine learning workflow. In this talk, we'll explore not only what you can do with Yellowbrick, but how it works under the hood (since we're always looking for new contributors!). We'll illustrate how Yellowbrick extends the Scikit-Learn and Matplotlib APIs with a new core object: the Visualizer. Visualizers allow visual models to be fit and transformed as part of the Scikit-Learn Pipeline process - providing iterative visual diagnostics throughout the transformation of high dimensional data.

Yellowbrick: Steering machine learning with visual transformers

Rebecca Bilbro

Visualizing the model selection process

Rebecca Bilbro

NLP for Everyday People

Rebecca Bilbro

Más de Rebecca Bilbro (20)

Data Structures for Data Privacy: Lessons Learned in Production

Conflict-Free Replicated Data Types (PyCon 2022)

(Py)testing the Limits of Machine Learning

Anti-Entropy Replication for Cost-Effective Eventual Consistency

The Promise and Peril of Very Big Models

Beyond Off the-Shelf Consensus

PyData Global: Thrifty Machine Learning

EuroSciPy 2019: Visual diagnostics at scale

Visual diagnostics at scale

Steering Model Selection with Visual Diagnostics: Women in Analytics 2019

A Visual Exploration of Distance, Documents, and Distributions

Words in space

The Incredible Disappearing Data Scientist

Learning machine learning with Yellowbrick

Escaping the Black Box

Data Intelligence 2017 - Building a Gigaword Corpus

Building a Gigaword Corpus (PyCon 2017)

Yellowbrick: Steering machine learning with visual transformers

Visualizing the model selection process

NLP for Everyday People

Último

Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We are available 24*7 Booking Contact Details :- WhatsApp Chat :- +91-7014168258 If you're looking for India Call girls you've come to the right place. You'll find some of the most beautiful call girls in our location with. These ladies have pleasing personalities, hot figures, and a passion for physical pleasure. Call girls in India Lucknow Many men have booked them for their erotic and soul-mixing performances, which are sure to leave you with unforgettable memories. #K09 Escort Service India is available in the city for men and women of all ages. They can satisfy your sexual needs and will make your experience even more enjoyable and memorable. Whether you're looking for a blow-job, stripping, lovemaking, or other dirty acts, you'll be able to find a match for your tastes and budget. These highly trained professionals will help you have an unforgettable night. One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7014168258 We are available 24*7 all days of the year. Call us — 7014168258 Thank you for Visiting.

Top profile Call Girls In Purnia [ 7014168258 ] Call Me For Genuine Models We...

nirzagarg

Harnessing the Power of GenAI for BI and Reporting.pptx

Paras Gupta

Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models We are available 24*7 Booking Contact Details :- WhatsApp Chat :- +91-7014168258 If you're looking for India Call girls you've come to the right place. You'll find some of the most beautiful call girls in our location with. These ladies have pleasing personalities, hot figures, and a passion for physical pleasure. Call girls in India Lucknow Many men have booked them for their erotic and soul-mixing performances, which are sure to leave you with unforgettable memories. #K09 Escort Service India is available in the city for men and women of all ages. They can satisfy your sexual needs and will make your experience even more enjoyable and memorable. Whether you're looking for a blow-job, stripping, lovemaking, or other dirty acts, you'll be able to find a match for your tastes and budget. These highly trained professionals will help you have an unforgettable night. One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7014168258 We are available 24*7 all days of the year. Call us — 7014168258 Thank you for Visiting.

Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...

gajnagarg

Klinik_ Apotek Onlin 085657271886 Solusi Menggugurkan Masalah Kehamilan Anda Jual Obat Aborsi Asli KLINIK ABORSI TERPEECAYA _ Jual Obat Aborsi Cytotec Misoprostol Asli 100% Ampuh Hanya 3 Jam Langsung Gugur || OBAT PENGGUGUR KANDUNGAN AMPUH MANJUR OBAT ABORSI OLINE" APOTIK Jual Obat Cytotec, Gastrul, Gynecoside Asli Ampuh. JUAL ” Obat Aborsi Tuntas | Obat Aborsi Manjur | Obat Aborsi Ampuh | Obat Penggugur Janin | Obat Pencegah Kehamilan | Obat Pelancar Haid | Obat terlambat Bulan | Ciri Obat Aborsi Asli | Obat Telat Bulan | Pil Aborsi Asli | Cara Menggugurkan Konten | Cara Aborsi Tuntas | Harga Obat Aborsi Asli | Pil Aborsi | Jual Obat Aborsi Cytotec | Cara Aborsi Sendiri | Cara Aborsi Usia 1 Bulan | Cara Aborsi Usia 2 Tahun | Cara Aborsi Usia 3 Bulan | Obat Aborsi Usia 4 Bulan | Cara Abrasi Usia 5 Bulan | Cara Menggugurkan Konten | Kandungan Obat Penggugur | Cara Menghitung Usia Konten | Cara Mengatasi Terlambat Bulan | Penjual Obat Aborsi Asli | Obat Aborsi Garansi | Kandungan Obat Peluntur | Obat Telat Datang Bulan | Obat Telat Haid | Obat Aborsi Paling Murah | Klinik Jual Obat Aborsi | Jual Pil Cytotec | Apotik Jual Obat Aborsi | Kandungan Dokter Abrasi | Cara Aborsi Cepat | Jual Obat Aborsi Bergaransi | Jual Obat Cytotec Asli | Obat Aborsi Aman Manjur | Obat Misoprostol Cytotec Asli. "APA ITU ABORSI" “Aborsi Adalah dengan membendung hormon yang di perlukan untuk mempertahankan kehamilan yaitu hormon progesteron, karena hormon ini dibendung, maka jalur kehamilan mulai membuka dan leher rahim menjadi melunak,sehingga mengeluarkan darah yang merupakan tanda bahwa obat telah bekerja || maksimal 1 jam obat diminum || PENJELASAN OBAT ABORSI USIA 1 _7 BULAN Pada usia kandungan ini, pasien akan merasakan sakit yang sedikit tidak berlebihan || sekitar 1 jam ||. namun hanya akan terjadi pada saatdarah keluar merupakan pertanda menstruasi. Hal ini dikarenakan pada usiakandungan 3 bulan,janin sudah terbentuk sebesar kepalan tangan orang dewasa. Cara kerja obat aborsi : JUAL OBAT ABORSI AMPUH dosis 3 bulan secara umum sama dengan cara kerja || DOSIS OBAT ABORSI 2 bulan”, hanya berbedanya selain mengisolasijanin juga menghancurkan janin dengan formula methotrexate dikandungdidalamnya. Formula methotrexate ini sangat ampuh untuk menghancurkan janinmenjadi serpihan-serpihan kecil akan sangat berguna pada saat dikeluarkan nanti. APA ALASAN WANITA MELAKUKAN ABORSI? Aborsi di lakukan wanita hamil baik yang sudah menikah maupun belum menikah dengan berbagai alasan , akan tetapi alasan yang utama adalah alasan-alasan non medis (termasuk aborsi sendiri / di sengaja/ buatan] MELAYANI PEMESANAN OBAT ABORSI SETIAP HARI, SIAP KIRIM KESELURUH KOTA BESAR DI INDONESIA DAN LUAR NEGERI. HUBUNGI PEMESANAN LEBIH NYAMAN VIA WA/: 085657271886

Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...

ZurliaSoop

Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We are available 24*7 Booking Contact Details :- WhatsApp Chat :- +91-7014168258 If you're looking for India Call girls you've come to the right place. You'll find some of the most beautiful call girls in our location with. These ladies have pleasing personalities, hot figures, and a passion for physical pleasure. Call girls in India Lucknow Many men have booked them for their erotic and soul-mixing performances, which are sure to leave you with unforgettable memories. #K09 Escort Service India is available in the city for men and women of all ages. They can satisfy your sexual needs and will make your experience even more enjoyable and memorable. Whether you're looking for a blow-job, stripping, lovemaking, or other dirty acts, you'll be able to find a match for your tastes and budget. These highly trained professionals will help you have an unforgettable night. One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7014168258 We are available 24*7 all days of the year. Call us — 7014168258 Thank you for Visiting.

Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...

nirzagarg

In my capstone project, I investigated the impact of COVID-19 on education. Using data analysis and statistical methods, I explored various aspects such as enrollment trends, access to resources, and socioeconomic disparities. I found a significant association between children missing classes and a lack of internet connection at home, as well as between household financial situations and children's enrollment in school. These findings highlight the importance of addressing disparities in internet access, household finances, and geographical location to ensure equal educational opportunities for all students during and beyond the pandemic.

Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION

LakpaYanziSherpa

Switzerland Constitution 2002.pdf.........

EfruzAsilolu

Saudi Arabia [ Abortion pills) Jeddah/riaydh/dammam/+966572737505☎️] cytotec tablets uses abortion pills 💊💊 How effective is the abortion pill? 💊💊 +966572737505) "Abortion pills in Jeddah" how to get cytotec tablets in Riyadh " Abortion pills in dammam*💊💊 The abortion pill is very effective. If you’re taking mifepristone and misoprostol, it depends on how far along the pregnancy is, and how many doses of medicine you take:💊💊 +966572737505) how to buy cytotec pills At 8 weeks pregnant or less, it works about 94-98% of the time. +966572737505[ 💊💊💊 At 8-9 weeks pregnant, it works about 94-96% of the time. +966572737505) At 9-10 weeks pregnant, it works about 91-93% of the time. +966572737505)💊💊 If you take an extra dose of misoprostol, it works about 99% of the time. At 10-11 weeks pregnant, it works about 87% of the time. +966572737505) If you take an extra dose of misoprostol, it works about 98% of the time. In general, taking both mifepristone and+966572737505 misoprostol works a bit better than taking misoprostol only. +966572737505 Taking misoprostol alone works to end the+966572737505 pregnancy about 85-95% of the time — depending on how far along the+966572737505 pregnancy is and how you take the medicine. +966572737505 The abortion pill usually works, but if it doesn’t, you can take more medicine or have an in-clinic abortion. +966572737505 When can I take the abortion pill?+966572737505 In general, you can have a medication abortion up to 77 days (11 weeks)+966572737505 after the first day of your last period. If it’s been 78 days or more since the first day of your last+966572737505 period, you can have an in-clinic abortion to end your pregnancy.+966572737505 Why do people choose the abortion pill? Which kind of abortion you choose all depends on your personal+966572737505 preference and situation. With+966572737505 medication+966572737505 abortion, some people like that you don’t need to have a procedure in a doctor’s office. You can have your medication abortion on your own+966572737505 schedule, at home or in another comfortable place that you choose.+966572737505 You get to decide who you want to be with during your abortion, or you can go it alone. Because+966572737505 medication abortion is similar to a miscarriage, many people feel like it’s more “natural” and less invasive. And some+966572737505 people may not have an in-clinic abortion provider close by, so abortion pills are more available to+966572737505 them. +966572737505 Your doctor, nurse, or health center staff can help you decide which kind of abortion is best for you. +966572737505 More questions from patients: Saudi Arabia+966572737505 CYTOTEC Misoprostol Tablets. Misoprostol is a medication that can prevent stomach ulcers if you also take NSAID medications. It reduces the amount of acid in your stomach, which protects your stomach lining. The brand name of this medication is Cytotec®.+966573737505) Unwanted Kit is a combination of two medicin

Abortion pills in Jeddah | +966572737505 | Get Cytotec

Abortion pills in Riyadh +966572737505 get cytotec

原版定制【微信:153539019】《(UCD毕业证书）加州大学戴维斯分校毕业证》【微信:153539019】（留信学历认证永久存档查询）采用学校原版纸张、特殊工艺完全按照原版一比一制作（包括：隐形水印，阴影底纹，钢印LOGO烫金烫银，LOGO烫金烫银复合重叠，文字图案浮雕，激光镭射，紫外荧光，温感，复印防伪）行业标杆！精益求精，诚心合作，真诚制作！多年品质 ,按需精细制作，24小时接单,全套进口原装设备，十五年致力于帮助留学生解决难题，业务范围有加拿大、英国、澳洲、韩国、美国、新加坡，新西兰等学历材料，包您满意。【业务选择办理准则】一、工作未确定，回国需先给父母、亲戚朋友看下文凭的情况，办理一份就读学校的毕业证【微信153539019】文凭即可二、回国进私企、外企、自己做生意的情况，这些单位是不查询毕业证真伪的，而且国内没有渠道去查询国外文凭的真假，也不需要提供真实教育部认证。鉴于此，办理一份毕业证【微信153539019】即可三、进国企，银行，事业单位，考公务员等等，这些单位是必需要提供真实教育部认证的，办理教育部认证所需资料众多且烦琐，所有材料您都必须提供原件，我们凭借丰富的经验，快捷的绿色通道帮您快速整合材料，让您少走弯路。留信网认证的作用: 1:该专业认证可证明留学生真实身份 2:同时对留学生所学专业登记给予评定 3:国家专业人才认证中心颁发入库证书 4:这个认证书并且可以归档倒地方 5:凡事获得留信网入网的信息将会逐步更新到个人身份内，将在公安局网内查询个人身份证信息后，同步读取人才网入库信息 6:个人职称评审加20分 7:个人信誉贷款加10分 8:在国家人才网主办的国家网络招聘大会中纳入资料，供国家高端企业选择人才【关于价格问题（保证一手价格）】我们所定的价格是非常合理的，而且我们现在做得单子大多数都是代理和回头客户介绍的所以一般现在有新的单子我给客户的都是第一手的代理价格，因为我想坦诚对待大家不想跟大家在价格方面浪费时间对于老客户或者被老客户介绍过来的朋友，我们都会适当给一些优惠。

一比一原版(UCD毕业证书）加州大学戴维斯分校毕业证成绩单原件一模一样

wsppdmt

Abortion pills in Kuwait city !🌆+918761049707^) Where to get cytotec pills in salmiyah, WhatsApp '''(+918761049707)''' Explore Discreet and Safe Abortion Pill Options in Dubai, Empowering Women With Confidential and Medically Supervised Choices For Reproductive Health" Abortion Pills In Dubai, Abu Dhabi/Alain/Sharjah/RAK City Satwa-Mifepristone and Misoprostol Available in Dubai/Abu Dhabi - i Pills in Dubai and Cost Of Cytotec in WhatsApp (+918761049707) Abortion Pills in Dubai/Abu Dubai/ Abu Dhabi. Are you stranded with unwanted pregnancy in Dubai, Abu Dhabi , the United Arab Emirates (U.A.E), Qatar, Oman, Saudi Arabia or Kuwait? you can now contact us now on Whatsapp Dr AJ:+918761049707to buy safe abortion pills In Dubai, Abu Dhabi, Sharjah, Al Ain, Ajman, RAK City, Ras Al Khaimah and Fujairah to terminate an unwanted pregnancy in Dubai and the United Arab Emirates. Get your Discreet 100% Safe (+918761049707 )*Effective Abortion Pills For Sale in Dubai, Abu Dhabi, KUWAIT, QATAR, BAHRAIN, DOHA, SALMIYA, Sharjah, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE. BUY Mifepristone and Misoprostol (Cytotec), Mtp Kit In UAE. Abortion pills available in UAE (United Arab Emirates), Saudi Arabia, Kuwait, Oman, Bahrain and Qatar. Contact us today. +918761049707 -The UAE’s leading abortion care service in Dubai. Abortion Treatment. Medical Abortion. Surgical Abortion. Find A Clinic like Dr Maria Abortion clinic in Dubai We have Abortion Pills / Cytotec Tablets Available in Dubai, Abu Dhabi, KUWAIT, QATAR, BAHRAIN, DOHA, SALMIYA, Sharjah, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE., buy cytotec in Dubai abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain We sell original abortion medicine which includes: Cytotec 200mcg (Misoprostol), Mifepristone, Mifegest-kit, Misoclear, Emergency contraceptive pills, Morning after sex pills, ipills, pills to prevent pregnancy 72 hours after sex. All our pills are manufactured by reputable medical manufacturing companies like PFIZER. Medical abortion is easy and effective for everyone to perform in their own privacy. There are very few complications that may arise from medical abortion if one follows the right guidelines as instructed by the obstetrician. Abortion Pills in Dubai Can Now Be Offered at Dr Maria Abortion clinic in Dubai, F.D.A. Mifepristone and Misoprostol, the first of two drugs in medication abortions, previously had to be dispensed only by clinics, doctors or a few mail-order pharmacies like Dr Maria Abortion clinic in Dubai . Now, We can provide it. For the first time, retail pharmacies, like Dr Maria Abortion clinic in Dubai, will be Able to offer abortion pills in Dubai under a regulatory change made Tuesday by the Food and Drug Administration. The action could significantly expand access to abortion through medication. Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg

In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia

ahmedjiabur940

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Models We are available 24*7 Booking Contact Details :- WhatsApp Chat :- +91-7014168258 If you're looking for India Call girls you've come to the right place. You'll find some of the most beautiful call girls in our location with. These ladies have pleasing personalities, hot figures, and a passion for physical pleasure. Call girls in India Lucknow Many men have booked them for their erotic and soul-mixing performances, which are sure to leave you with unforgettable memories. #K09 Escort Service India is available in the city for men and women of all ages. They can satisfy your sexual needs and will make your experience even more enjoyable and memorable. Whether you're looking for a blow-job, stripping, lovemaking, or other dirty acts, you'll be able to find a match for your tastes and budget. These highly trained professionals will help you have an unforgettable night. One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7014168258 We are available 24*7 all days of the year. Call us — 7014168258 Thank you for Visiting.

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...

nirzagarg

Data Analyst Tasks to do the internship.pdf

theeltifs

Yilin Xia (yilinx2@illinois.edu), Shawn Bowers (bowers@gonzaga.edu), Lan Li (lanl2@illinois.edu), and Bertram Ludäscher (ludaesch@illinois.edu) Presented at IDCC-2024 in Edinburg. ABSTRACT. We propose a new approach for modeling and reconciling conflicting data cleaning actions. Such conflicts arise naturally in collaborative data curation settings where multiple experts work independently and then aim to put their efforts together to improve and accelerate data cleaning. The key idea of our approach is to model conflicting updates as a formal argumentation framework (AF). Such argumentation frameworks can be automatically analyzed and solved by translating them to a logic program PAF whose declarative semantics yield a transparent solution with many desirable properties, e.g., uncontroversial updates are accepted, unjustified ones are rejected, and the remaining ambiguities are exposed and presented to users for further analysis. After motivating the problem, we introduce our approach and illustrate it with a detailed running example introducing both well-founded and stable semantics to help understand the AF solutions. We have begun to develop open source tools and Jupyter notebooks that demonstrate the practicality of our approach. In future work we plan to develop a toolkit for conflict resolution that can be used in conjunction with OpenRefine, a popular interactive data cleaning tool.

Reconciling Conflicting Data Curation Actions: Transparency Through Argument...

Bertram Ludäscher

Digital advertising, or paid media, encompasses the strategic deployment of online advertisements to reach target audiences efficiently and effectively. This includes any digital platform that supports advertising to deliver unique messages for any objective. Understanding the mechanics of digital advertising platforms, along with insights into audience behaviors and preferences, allows marketers to optimize their ad spend and achieve significant engagement and conversion rates. This lecture is for Advanced Digital & Social Media Strategy (MGMTX 466.05) at UCLA Extension.

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Valters Lauzums

Gartner's Data Analytics Maturity Model.pptx

chadhar227

Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models We are available 24*7 Booking Contact Details :- WhatsApp Chat :- +91-7014168258 If you're looking for India Call girls you've come to the right place. You'll find some of the most beautiful call girls in our location with. These ladies have pleasing personalities, hot figures, and a passion for physical pleasure. Call girls in India Lucknow Many men have booked them for their erotic and soul-mixing performances, which are sure to leave you with unforgettable memories. #K09 Escort Service India is available in the city for men and women of all ages. They can satisfy your sexual needs and will make your experience even more enjoyable and memorable. Whether you're looking for a blow-job, stripping, lovemaking, or other dirty acts, you'll be able to find a match for your tastes and budget. These highly trained professionals will help you have an unforgettable night. One Shot — 5000/in call (time 1 hour), 6000/out call Two shot with one girl — 8000/in call (time 2 hour), 10000/out call Body to body massage with sex- 8000/in call (time 1 hour) Full night Service for one person– 12000/in call, 13000/out call (shot limit 3-4 shots) Full night Service for more than 1 person — please contact Us —7014168258 We are available 24*7 all days of the year. Call us — 7014168258 Thank you for Visiting.

Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...

gajnagarg

Context 1. Housing Agent collected resale prices on HDB apartments in Singapore. Objective 2. To predict resale prices in to advise his potential clients. Strategies 3. Explore & Clean data for analysis. 4. Perform K-Means Clustering, in Orange, to find possible segments in the customer data. 5. Tune the model to improve its performance. 6. Visualise the findings, share conclusions, and give insight-driven recommendations. Author: Anthony mok Date: 18 Nov 2023 Email: xxiaohao@yahoo.com

Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange

ThinkInnovation

Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now Booking Contact Details :- WhatsApp Chat :- +91-7737669865 We offer all types of girls of your choice with space. Our escorts are fully cooperative and understand your needs. All types of call girls like Housewives, College girls,#K09 Russian girls, Muslim girls, Afghani girls, Bengali girls, Working girls, south Indian girls, Punjabi girls, etc. In-Call: — You Can Reach At Our Place in Bangalore Our place Which Is Very Clean Hygienic 100% safe Accommodation. Out-Call: — Service for Out Call You have To Come Pick The Girl From My Place We Also Provide Door-Step Services Hygienic: — Full Ac Neat And Clean Rooms Available In Hotel 24 * 7 Hrs In Bangalore Our Services and Rates: – One Shot — 2500/in call (time ½ hour), 5000/out call Two shot with one girl — 5000/in call (time 1 hour), 6000/out call Body to body massage with sex- 3000/in call (time 1 hour) full night for one person– 8000/in call, 10000/out call (shot limit 4 shot) We are available 24*7 all days of the year

Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now

gargpaaro

7. Epi of Chronic respiratory diseases.ppt

ibrahimabdi22

Jual Obat Aborsi Cytotec & Gastrul Asli 2024 ⋆ 082223109953 ⋆ Cara Menggugurkan Kandungan Untuk Usia Janin 1-8 Bulan Secara Alami Dan Cepat Dalam 1 Hari Gugur Tuntas KLINIK _ APOTIK ONLINE SOLUSI MENGGUGURKAN MASALAH KEHAMILAN ANDA | JUAL OBAT ABORSI ASLI ( WA – 082223109953 ) KLINIK ABORSI TERPEECAYA _ JUAL OBAT ABORSI CYTOTEC MISOPROSTOL ASLI 100% AMPUH HANYA 3 JAM LANGSUNG GUGUR || OBAT PENGGUGUR JANIN KANDUNGAN AMPUH | JUAL OBAT ABORSI ASLI, AMPUH, MANJUR, TUNTAS | OBAT ABORSI OLINE “APOTIK JUAL OBAT CYTOTEC, GASTRUL, GYNACOSIDE ASLI AMPUH. JUAL ” OBAT ABORSI TUNTAS | OBAT ABORSI MANJUR | OBAT ABORSI AMPUH | OBAT PENGGUGUR JANIN | OBAT PENCEGAH KEHAMILAN | OBAT PELANCAR HAID | OBAT TERLAMBAT BULAN | CIRI OBAT ABORSI ASLI | OBAT TELAT BULAN | PIL ABORSI ASLI | CARA MENGGUGURKAN KANDUNGAN | CARA ABORSI TUNTAS | HARGA OBAT ABORSI ASLI | PIL ABORSI | JUAL OBAT ABORSI CYTOTEC | CARA ABORSI SENDIRI | CARA ABORSI USIA 1 BULAN | CARA ABORSI USIA 2 BULA | CARA ABORSI USIA 3 BULAN | OBAT ABORSI USIA 4 BULAN | CARA ABORSI USIA 5 BULAN | CARA MENGGUGURKAN KANDUNGAN | OBAT PENGGUGUR KANDUNGAN | CARA MENGHITUNG USIA KANDUNGAN | CARA MENGATASI TERLAMBAT BULAN | PENJUAL OBAT ABORSI ASLI | OBAT ABORSI GARANSI | OBAT PELUNTUR KANDUNGAN | OBAT TELAT DATANG BULAN | OBAT TELAT HAID | OBAT ABORSI PALING MURAH | KLINIK JUAL OBAT ABORSI | JUAL PIL CYTOTEC | APOTIK JUAL OBAT ABORSI | DOKTER ABORSI KANDUNGAN | CARA ABORSI CEPAT | JUAL OBAT ABORSI BERGARANSI | JUAL OBAT CYTOTEC ASLI | OBAT ABORSI AMAN MANJUR | OBAT MISOPROSTOL CYTOTEC ASLI

Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur

ptikerjasaptiker

Camlis

1. Inferring Model Families from Deployed Black Boxes Dr. Rebecca Bilbro CAMLIS 2018

2. Rebecca Bilbro Co-creator & Core Contrib, Scikit-Yb Adjunct Faculty, Georgetown Univ. Emeritus, Data Community DC github.com/rebeccabilbro twitter.com/rebeccabilbro

3. Data science! What could go wrong?

4. Just anonymize the data? ID Name SSN Age Ethnicity Condition 1 redacted redacted 15 African American Bronchitis 2 redacted redacted 15 Caucasian Bronchitis 3 redacted redacted 17 Hispanic Asthma 4 redacted redacted 17 Hispanic Eczema 5 redacted redacted 17 African American Eczema 6 redacted redacted 18 Asian American HIV/AIDS 7 redacted redacted 18 Asian American HIV/AIDS

5. Nope, not differentially private ID Name SSN Age Ethnicity Condition 1 redacted redacted 15 African American Bronchitis 2 redacted redacted 15 Caucasian Bronchitis 3 redacted redacted 17 Hispanic Asthma 4 redacted redacted 17 Hispanic Eczema 5 redacted redacted 17 African American Eczema 6 redacted redacted 18 Asian American HIV/AIDS 7 redacted redacted 18 Asian American HIV/AIDS

7. Safety in black boxes? Automated Build Data Insight

8. training data fitted model application interface user

9. training data fitted model application interface user Oops

10. Useful for Model Inversion ● Linearity: the more linear the model, the easier to perturb (Goodfellow et al. 2015) ● Prediction metadata: confidence scores, class prediction probabilities, or decision functions make inversion easier (Fredrickson et al. 2015) ● Commercial MLAAS: reverse-engineering is easy because the models, hyperparameters used for training are known (Tràmer et al. 2016) ● Deployed black boxes: private training data can be extracted from prediction behavior (Song et al. 2017)

11. How much can be determined about a fitted model?

12. ● Open source Python library, extends Scikit-Learn API. ● Model (not data) visualization. ● Tools for feature engineering, visual diagnostics, evaluation, and steering. ● Enhances the model selection process. Yellowbrick E.g. ScoreVisualizers to gauge accuracy and diagnose problems like overfit and heteroskedasticity

13. How can we anticipate model-specific attack vectors?

14. First, some definitions “‘Model’ is an overloaded term.” - Hadley Wickham (2015) ● Model family: high-level relationships between variables of interest. ● Model form: specific relationships between variables inside model family framework. ● Fitted model: concrete instance of model form where all parameters have been estimated from data; used to generate predictions. Do fitted models exhibit distinctive topologies you could use to infer family or form?

15. Decision Topologies

16. Linear Models

17. Trees and Ensembles

18. Nearest Neighbors

19. Radial Basis Function Kernels

20. Strategic Perturbations?

21. How noisy was the original data? How much noise to subvert inversion?

22. Add more smoothing than is strictly necessary, so long as it doesn’t increase error?

23. Inspect the spread of class predictions from the average?

24. Thank you!

Notas del editor

While data privacy challenges long predate current trends in machine-learning-as-a-service (MLAAS) offerings, predictive APIs do expose significant new attack vectors. To provide users with tailored recommendations, these applications often expose endpoints either to dynamic models or to pre-trained model artifacts, which learn patterns from data to surface insights. Problems arise when training data are collected, stored, and modeled in ways that jeopardize privacy. Even when user data is not exposed directly, private information can often be inferred using a technique called model inversion. In this talk, I discuss current research in black box model inversion and present a machine learning approach to discovering the model families of deployed black box models using only their decision topologies. Prior work suggests the efficacy of model family specific attack vectors (i.e., once the model is no longer a black box, it is easier to exploit). As such, we approach the problem only of model discovery and not of model inversion, reasoning that by solving the problem of model identification, we clear a path for information security and cryptography experts to use domain-specific tools for model inversion.
A bit about me: I’m a data scientist, a generalist, interested in NLP and Visual Diagnostics
Data Science is often about consuming data for a purpose it wasn’t originally intended for. This can be tricky because security and privacy are not standard parts of most data science curricula yet.
So when data scientists move from doing just downstream analytics, get access to data further up the chain, or start potentially collecting their own data via deployed applications, we can run into problems.
Even though the name and SSN have be scrubbed, 100% of the 18-year-old Asian Americans are listed as having HIV/AIDS. In communities where the population of Asian Americans is sufficiently small, this is tantamount to directly exposing PII. I’ve learned a lot as a data scientist from the differential privacy discussion, and from people like Jim Klucar
Now with the GDPR, more and more app developers are thinking about data security issues. Strava's online exercise-tracking map unwittingly revealed remote military outposts in Afghanistan, Iraq, Syria, and Djibouti — and even the identities of soldiers based there. (Nov 2017)
But, there is a sense that black box models are relatively secure. This is part of the promise of Machine Learning as a Service offerings.
So how does MLAAS work? Data is used to train a model, and the model is serialized and hosted as an application artifact together with the other compiled source and executables. Users enter data, which is transformed at the application layer into REST-like calls to the model, which passes back a prediction.
But, given enough API calls, this deployed black box could expose more than just predictions. Each prediction generates a kind of new training vector -> (input data, ŷ) We could exploit this. Given some parts of other users’ data, we might be able to reverse engineer the rest.
Research is increasingly finding more evidence of the vulnerabilities of black box models
As I’ve said, I’m no security researcher, but I do think a lot about what we can determine about fitted models.
Yellowbrick is an open source Python library I started building with my colleague Benjamin Bengfort about 4 years ago. Yellowbrick is for… Data scientists to evaluate the stability and predictive value of their models. Data engineers to monitor model performance in real world applications. Users of models to interpret model behavior in high dimensional space. Students to understand a large variety of algorithms and methods. Information security specialists…?
Could visual diagnostics be used to identify model-specific attack vectors?
A visual signature?
RBF kernels give models a distinct signature
Use these signatures to steer strategic perturbations in our models before we deploy them?

Camlis

Recomendados

Recomendados

Más contenido relacionado

Similar a Camlis

Similar a Camlis (20)

Más de Rebecca Bilbro

Más de Rebecca Bilbro (20)

Último

Último (20)

Camlis

Notas del editor