Module 5

The document outlines the ethics of data science, emphasizing the responsibilities of actuaries in managing data integrity, developing AI models, and ensuring compliance with ethical standards. It discusses key ethical principles such as privacy, bias, transparency, accountability, and the impact of data science on society. Additionally, it highlights laws governing data privacy and the importance of informed consent in data collection practices.

Uploaded by

brijeshsingh2592002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views36 pages

Module 5

Uploaded by

brijeshsingh2592002

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Module 5

Ethics of data science

An actuary

● An actuary is a professional who uses mathematics,

statistics, and financial theory to assess and manage risk
and uncertainty.
● They help businesses and clients develop policies to
minimize the financial costs of potential events.
Responsibilities of actuaries around data science and
AI

Actuaries play a critical role in integrating data science and AI into their
work, enhancing their ability to analyze risks and predict future outcomes.
Their responsibilities in this area include:
1. Data Management and Integrity: Actuaries ensure the accuracy,
quality, and ethical use of large datasets, crucial for building reliable
predictive models. They clean and prepare data for analysis, ensuring
it meets high standards.
2. AI and Machine Learning Model Development: Actuaries
develop and apply AI and machine learning models to predict
risks, price insurance products, and optimize financial strategies.
They ensure that these models are transparent, explainable,
and aligned with actuarial principles.
3. Risk Evaluation and Mitigation: They assess the risks
introduced by AI, such as algorithmic bias or cybersecurity
vulnerabilities. Actuaries are responsible for ensuring fairness in
the use of AI and mitigating potential negative impacts.
4. Ethical and Regulatory Compliance: Actuaries ensure
that AI and data science practices comply with industry
regulations and ethical standards, preventing discriminatory
outcomes in areas like insurance underwriting.
5. Automation and Process Optimization: They use AI to
automate repetitive tasks, such as claims processing or data
analysis, improving efficiency and allowing actuaries to focus
on more strategic decision-making.
data science ethics

The ethics of data science revolve around the moral principles and
guidelines that govern the responsible use of data in research, analysis,
and application.

1. Privacy and Consent

● Data Privacy: Ensuring that personal data is protected and that
sensitive information is not misused.
● Informed Consent: Individuals should be informed about how their
data will be used and must provide consent.
● Anonymization: Data should be anonymized or de-identified to
protect individual identities when analyzing personal information.
2. Bias and Fairness
● Bias in Data: Data sets may have inherent biases that can lead to unfair
outcomes, especially in predictive models. Ensuring data is
representative and fair is essential.
● Algorithmic Fairness: Algorithms should not disproportionately
disadvantage or discriminate against particular groups based on race,
gender, socioeconomic status, or other factors.
● Equity in Outcomes: Consideration should be given to ensuring
equitable outcomes for all groups, especially when algorithms are used in
high-stakes areas like healthcare or criminal justice.
3. Transparency and Explainability
● Transparency: Data scientists must be transparent about the
methods and data used, especially when their work affects public
policy or individuals' lives.
● Explainability: Models, especially complex ones like deep learning,
can be opaque ("black-box" models). It’s important to make these
models interpretable or explainable so that their decisions can be
understood by non-experts.
4. Accountability
● Responsibility: Data scientists and organizations must take
responsibility for the outcomes of their models, especially when
errors or biases occur.
● Governance and Regulation: There should be appropriate
oversight, laws, and policies to ensure that data science is applied
in a way that aligns with ethical standards.
5. Impact on Society
● Social Good: Data science should be used to promote social
good and minimize harm. Professionals must consider the
broader impact of their work on society.
● Autonomy and Human Rights: Data science should respect
human rights and personal autonomy, avoiding manipulative
practices such as "surveillance capitalism" or unjust data-based
decision-making.
6. Data Ownership and Usage
● Who Owns Data?: Questions about data ownership,
particularly for large platforms that collect massive amounts of
user data, need ethical considerations.
● Usage Boundaries: Even if data is legally obtained, its usage
might have ethical implications. For example, predictive policing
based on historical data may perpetuate existing injustices.
owners of the data
● The owners of the data refer to individuals,
organizations, or entities that have legal rights and
control over the data.
● Data ownership determines who has the authority to
decide how the data is accessed, used, shared, and
monetized.
1. Individuals (Data Subjects):
Individuals are often considered the rightful owners of their personal
data. For example, personal information such as names, addresses,
medical records, and browsing behavior is directly linked to
individuals.
Example: If a person uses a fitness tracker, the data collected from
the device (heart rate, activity level, etc.) belongs to the individual, but
the company may also have rights to process and use it depending on
the terms agreed to.
2. Organizations that Collect the Data (Data Controllers):
Companies and organizations that collect and store data are often
considered data controllers. While they may not “own” the personal
data in the strictest sense , they control how the data is processed and
used.
Example: Social media platforms like Facebook or Twitter collect vast
amounts of user data, which they control and often monetize. Even
though the data comes from individuals, the platform has the right to
use the data based on the user agreements signed during account
creation.
3. Data Processors (Third-Party Organizations):
● Data processors are entities that process data on behalf of the data
controller. While they may not "own" the data, they have access to it
and are bound by the agreements they have with the data controller.
They must adhere to privacy regulations and terms of use outlined by
the data controller.
● Example: A cloud service provider storing customer data for a bank
would be a data processor. They do not own the customer data but
are responsible for safeguarding it and following the bank's
instructions regarding its use.
4. Government Bodies:
● Governments may claim ownership of certain types of data,
particularly public data such as demographic information or data
collected for regulatory purposes (e.g., tax records). Governments
also have rights over data that they collect for national security or
public health.
● Example: A government agency like the U.S. Census Bureau owns
the national population data collected during the census, and this data
is used for policy-making and resource allocation.
5. Data Brokers and Aggregators:
● Data brokers are companies that collect data from various
sources, aggregate it, and sell or license it to other companies.
While they may not technically "own" the original raw data, they
often gain ownership of the compiled datasets they create by
adding value (e.g., cleaning, organizing, or analyzing data).
● Example: , iCIBIL one of India’s major credit information
companies. CIBIL collects and aggregates data on individuals’
credit histories from banks and financial institutions.The data is
sold to banks, insurance companies, and lenders to assess an
individual's creditworthiness, which helps them make decisions on
loans and credit approvals.
Valuing Different Aspects of Privacy in Data Science

Data Privacy
data privacy refers to the right of individuals to control how
their personal information is collected, used, and shared by
others.
This can include a wide range of information, from basic
demographic data such as age and gender to more sensitive
data such as medical history or financial information.
laws that govern data privacy
1. General Data Protection Regulation (GDPR):
The GDPR is a regulation enacted by the European Union (EU) in 2018 that
applies to all EU member states. The GDPR aims to protect individual's personal
data by regulating its processing and transfer, giving individuals more control
over their data, and establishing penalties for non-compliance.
2. California Consumer Privacy Act (CCPA):
The CCPA is a law enacted in California, USA, in 2018 that gives California
residents more control over their personal data. The CCPA requires
organizations to disclose the types of personal data they collect, how it is used,
and to whom it is sold. The CCPA also gives individuals the right to request
access to their personal data, have it deleted, and opt out of its sale.
India’s data privacy laws

1. India Digital Personal Data Protection Act (DPDPA) of 2023

This law went into effect on September 1, 2023 and applies to all organizations
that process personal data of individuals in India.
The law requires companies to get users' consent before processing their
personal data, and gives users the right to withdraw consent at any time. The
law also establishes the Data Protection Board (DPB) to enforce the law and
ensure companies comply with data protection regulations.
2. Information Technology Act of 2000
This act and its subsequent revisions recognize the need for
data privacy protection in India. The act includes provisions
that cover some data collection, usage, retention, and
disclosure issues.
The Supreme Court of India has also recognized the right
to privacy as a fundamental right under Article 21 of the
Constitution of India.
How data privacy is managed and preserved in data science?
1. Anonymization and pseudonymization:

● Anonymization involves removing personally identifiable

information from data, so it cannot be linked back to an
individual.
● Pseudonymization involves replacing personally identifiable
information with a pseudonym or code, so the data is not
directly identifiable but can be linked back to an individual if
necessary.
2. Data minimization:

● Data minimization involves collecting and storing only

the minimum amount of data necessary to achieve a
specific purpose.
● This can help to reduce the amount of personal data
being collected and limit the risk of data breaches or
privacy violations.
● Data scientists should also consider the specific types of
data they are collecting and ensure that sensitive or
confidential information is not being collected
unnecessarily.
3. Access controls:

● Access controls can help to protect data privacy by

limiting who has access to sensitive or confidential
information.
● Access controls can include password protection,
multi-factor authentication, or other security measures
to limit access to sensitive data.
● Data scientists should also ensure that access
controls are regularly reviewed and updated to ensure
that access is only granted to authorized personnel.
4. Secure data storage:

● Data scientists should ensure that personal data is

stored securely and encrypted if necessary.
● This can include using secure servers or cloud storage
services, implementing firewalls or other security
measures, and regularly backing up data to prevent data
loss.
● Data scientists should also ensure that data storage
policies comply with privacy regulations, such as the
GDPR or CCPA.
5. Data sharing agreements:

● Data sharing agreements can help to protect data privacy

when sharing data with third parties.
● Data scientists should ensure that data-sharing
agreements include provisions for protecting data privacy,
such as requiring third parties to comply with relevant
privacy regulations, implementing appropriate security
measures, and limiting the use of data to specific
purposes.
6. Ethical data science practices:

Ethical data science practices can include regular review

and updating of privacy policies and procedures, ensuring
that privacy considerations are integrated into all stages of
the data lifecycle, and promoting transparency and
accountability in data handling and analysis.
the five C's of data science ethics
The five C's of data science ethics are guiding principles to ensure
responsible and ethical practices in data science.

1. Consent
● What it means: Ensuring that individuals are informed about how
their data is being collected, used, and shared, and that they have
agreed to these practices.
● Key considerations: Clear communication, voluntary participation,
and respecting withdrawal of consent.
● Example: Informed Consent for Data Collection in Mobile Health
Application.
2. Clarity
● What it means: Data collection and model building should be
transparent. Users should understand how algorithms work, what data is
used, and the potential impacts of decisions made by the models.
● Key considerations: Transparency, simplicity in explanation, and
ensuring the public or stakeholders can understand how data is
processed.
● Example: Publishing an easily understandable explanation of how a
recommendation algorithm works, so users know why they receive certain
recommendations.
3. Consistency
● What it means: Ethical practices must be applied consistently
across all phases of data science projects and across all
individuals affected.
● Key considerations: Ensuring fairness, equal treatment, and
avoiding biases in data collection, model training, and deployment.
● Example: Ensuring that a machine learning model treats all
demographic groups fairly and is not biased against any particular
race or gender.
4. Confidentiality
● What it means: Protecting individuals’ privacy and ensuring that sensitive
or personal data is kept secure and is not disclosed without proper
authorization.
● Key considerations: Data anonymization, encryption, and adhering to
privacy laws like GDPR.
● Example: Using anonymized datasets when developing models and
ensuring data storage systems are secure from unauthorized access.
5. Consequences
● What it means: Understanding the potential social, economic, and
personal impacts of data-driven decisions and ensuring that data
science outcomes do not cause harm.
● Key considerations: Assessing the downstream effects of data
science work, including unintended negative consequences.
● Example: Testing algorithms for any harmful bias, ensuring that
predictive models in fields like criminal justice or hiring do not
perpetuate inequality.
Steps of Getting informed consent
1. Provide Clear Information:
● Explain what data will be collected (e.g., personal information, browsing habits,
health data).
● Describe how the data will be used, who will have access, and if it will be shared
with third parties.
● Include the purpose of data collection (e.g., research, improving services,
marketing).
1. Explain the Risks and Benefits:
● Outline any potential risks, such as data breaches or unintended consequences.
● Explain how the individual benefits from sharing their data (e.g., personalized
services, contributing to research).
3. Ensure Voluntariness:
● Consent must be freely given without coercion.
● Offer the ability to decline or revoke consent at any time without
facing negative consequences.
4. Use Plain Language:
● Avoid technical jargon so that individuals can easily understand
the terms of consent.
● Include a summary of the terms for clarity, with the option to read
more detailed policies.
5. Allow for Opt-Out:
● Give individuals the choice to opt out of specific data
uses (e.g., marketing) or withdraw their consent
entirely.
6. Comply with Legal Standards:
● Ensure the consent process complies with laws like
GDPR or CCPA, which require specific, informed, and
voluntary consent.

Data Science Notes A
No ratings yet
Data Science Notes A
4 pages
The Ethics of Data Ownership
No ratings yet
The Ethics of Data Ownership
3 pages
Data Science Notes Resu
No ratings yet
Data Science Notes Resu
2 pages
The Study On Data Science - Ethics and Privacy - HBRP Publication
No ratings yet
The Study On Data Science - Ethics and Privacy - HBRP Publication
12 pages
6bti-Pe-Ga1-Ds-E-011,015,031 2
No ratings yet
6bti-Pe-Ga1-Ds-E-011,015,031 2
18 pages
NTCC Report On
No ratings yet
NTCC Report On
23 pages
classXII DS Teacher Handbook
No ratings yet
classXII DS Teacher Handbook
73 pages
Data Ethics
No ratings yet
Data Ethics
33 pages
Data Privacy
No ratings yet
Data Privacy
8 pages
Doing Good Data Science
No ratings yet
Doing Good Data Science
7 pages
Ethics in Data Science and Proper Privacy and Usage of Data
No ratings yet
Ethics in Data Science and Proper Privacy and Usage of Data
4 pages
Data Ethics and Privacy
No ratings yet
Data Ethics and Privacy
4 pages
Shukla Data Ethics
No ratings yet
Shukla Data Ethics
91 pages
An Introduction To Data Governance, Legislation, and Ethics (Slides)
No ratings yet
An Introduction To Data Governance, Legislation, and Ethics (Slides)
17 pages
Unit-4 Ethical Considerations in Data Privacy.-1
No ratings yet
Unit-4 Ethical Considerations in Data Privacy.-1
14 pages
Notess
No ratings yet
Notess
34 pages
Data Privacy and Protection Fundamentals
No ratings yet
Data Privacy and Protection Fundamentals
48 pages
A Reference Manual For Data Privacy Laws and Cyber Frameworks (Ravindra Das)
No ratings yet
A Reference Manual For Data Privacy Laws and Cyber Frameworks (Ravindra Das)
174 pages
Data Ethics Project
No ratings yet
Data Ethics Project
10 pages
Data Ethics
No ratings yet
Data Ethics
3 pages
Data Ethics for Researchers
No ratings yet
Data Ethics for Researchers
22 pages
Datascience811 StudyGuide
No ratings yet
Datascience811 StudyGuide
2 pages
Artificial Intelligence and Data A Responsible Approach
No ratings yet
Artificial Intelligence and Data A Responsible Approach
11 pages
Module 2
No ratings yet
Module 2
15 pages
Dataethics Uk
No ratings yet
Dataethics Uk
38 pages
Ethical Challenges in Big Data
No ratings yet
Ethical Challenges in Big Data
2 pages
Data Provacy m1 Source2
No ratings yet
Data Provacy m1 Source2
91 pages
Ethical Data Analytics Guide
No ratings yet
Ethical Data Analytics Guide
9 pages
Unit 5 - Data Privacy, Surveillance, and Policies
No ratings yet
Unit 5 - Data Privacy, Surveillance, and Policies
28 pages
Reference Sheet
No ratings yet
Reference Sheet
2 pages
Data Ethics: Key Principles & Practices
No ratings yet
Data Ethics: Key Principles & Practices
2 pages
Unit 7
No ratings yet
Unit 7
31 pages
Data Governance
No ratings yet
Data Governance
2 pages
UNIT 1 - Notes
No ratings yet
UNIT 1 - Notes
6 pages
Session 3 4 Data Literacy Privacy Ethics
100% (1)
Session 3 4 Data Literacy Privacy Ethics
19 pages
Assignment 1 - Frontsheet - Business Process Support
No ratings yet
Assignment 1 - Frontsheet - Business Process Support
24 pages
Privacy and Data Protection - Compressed
No ratings yet
Privacy and Data Protection - Compressed
36 pages
Priyanshu Que10
No ratings yet
Priyanshu Que10
4 pages
CSDP Unit 5
No ratings yet
CSDP Unit 5
25 pages
2 LEGAL BASIS FOR ENACTING DATA PROTECTION LEGISLATIONS Lyst8614
No ratings yet
2 LEGAL BASIS FOR ENACTING DATA PROTECTION LEGISLATIONS Lyst8614
19 pages
Critical Data Security and Privacy Principles
No ratings yet
Critical Data Security and Privacy Principles
3 pages
Ecode Research
No ratings yet
Ecode Research
4 pages
Case Study
No ratings yet
Case Study
8 pages
General Info
No ratings yet
General Info
5 pages
Data Use Paper
No ratings yet
Data Use Paper
3 pages
Ethics of Data Privacy in Digital Age
No ratings yet
Ethics of Data Privacy in Digital Age
12 pages
Practical Data Privacy Early Release 20220720 First Early Release Katharine Jarmul Instant Download
No ratings yet
Practical Data Privacy Early Release 20220720 First Early Release Katharine Jarmul Instant Download
53 pages
Data Privacy
No ratings yet
Data Privacy
37 pages
Bsd1313 Chapter 7
No ratings yet
Bsd1313 Chapter 7
53 pages
Data Science Ethics
No ratings yet
Data Science Ethics
11 pages
Title: Personal Data As Property: Unpacking Ownership, Rights, and Ethical Considerations
No ratings yet
Title: Personal Data As Property: Unpacking Ownership, Rights, and Ethical Considerations
9 pages
Data Literacy Notes
No ratings yet
Data Literacy Notes
5 pages
5th Unit Data Science 24
No ratings yet
5th Unit Data Science 24
12 pages
Data Science Foundations
No ratings yet
Data Science Foundations
58 pages
DATA4300 Week 01 Workshop
No ratings yet
DATA4300 Week 01 Workshop
40 pages
Digital Personal Data Protection Act, 2023
No ratings yet
Digital Personal Data Protection Act, 2023
5 pages
Data Security - 8
No ratings yet
Data Security - 8
33 pages
COMP5122 - Ethics 2 2024
No ratings yet
COMP5122 - Ethics 2 2024
32 pages
Exploring The Impact of Artificial Intelligence On Information Retrieval Systems
No ratings yet
Exploring The Impact of Artificial Intelligence On Information Retrieval Systems
5 pages
Global Trends in Government Innovation 2023 - Full Report - 0655b570-En
No ratings yet
Global Trends in Government Innovation 2023 - Full Report - 0655b570-En
140 pages
Previewpdf
No ratings yet
Previewpdf
72 pages
IAIS Report On FinTech Developments in The Insurance Sector
No ratings yet
IAIS Report On FinTech Developments in The Insurance Sector
18 pages
Ifcitt
No ratings yet
Ifcitt
29 pages
How Urban Morphology Relates To The Urban Heat Isl
No ratings yet
How Urban Morphology Relates To The Urban Heat Isl
20 pages
Schedule 3
No ratings yet
Schedule 3
6 pages
Academic CV Owais Makroo
No ratings yet
Academic CV Owais Makroo
2 pages
Explainable Deep Learning for ECG Classification
No ratings yet
Explainable Deep Learning for ECG Classification
13 pages
AI Supply Chain Optimization Guide
No ratings yet
AI Supply Chain Optimization Guide
27 pages
2024 MTH058 Lecture04 AILearningParadigms
No ratings yet
2024 MTH058 Lecture04 AILearningParadigms
85 pages
Tony Blair Institute Governing in The Age of AI May 2024 2
No ratings yet
Tony Blair Institute Governing in The Age of AI May 2024 2
74 pages
Deciphering The Enigma A Deep Dive Into Understand
No ratings yet
Deciphering The Enigma A Deep Dive Into Understand
11 pages
1 s2.0 S0010482522007569 Main
No ratings yet
1 s2.0 S0010482522007569 Main
23 pages
Competitive Landscape Cloud Providers Artificial Intelligence Services
No ratings yet
Competitive Landscape Cloud Providers Artificial Intelligence Services
23 pages
Explainable AI: Analytics Summit, 13 June 2019
No ratings yet
Explainable AI: Analytics Summit, 13 June 2019
24 pages
SM 0417 0420 323 Samira
No ratings yet
SM 0417 0420 323 Samira
9 pages
How Does AI Influence The Accuracy and Relevance of Knowledge Shared
No ratings yet
How Does AI Influence The Accuracy and Relevance of Knowledge Shared
11 pages
AI-Based Software Testing Overview
No ratings yet
AI-Based Software Testing Overview
21 pages
Journal of The Association For Information Systems Journal of The Association For Information Systems
No ratings yet
Journal of The Association For Information Systems Journal of The Association For Information Systems
10 pages
AI & ML Certification for Professionals
No ratings yet
AI & ML Certification for Professionals
30 pages
UPI Fraud Transaction Detection Using Machine Learning
No ratings yet
UPI Fraud Transaction Detection Using Machine Learning
79 pages
Designing The Interaction Between Humans and Autonomous Systems
No ratings yet
Designing The Interaction Between Humans and Autonomous Systems
14 pages
Evaluating The Correctness of Explainable AI
No ratings yet
Evaluating The Correctness of Explainable AI
14 pages
Rag-Ex A Generic Framework For Explaining Retrieval Augmented Generation
No ratings yet
Rag-Ex A Generic Framework For Explaining Retrieval Augmented Generation
5 pages
Evaluating Explainable Machine Learning Models For Clinicians
No ratings yet
Evaluating Explainable Machine Learning Models For Clinicians
11 pages
The Impact of Imperfect XAI On Human-AI Decision-Making
No ratings yet
The Impact of Imperfect XAI On Human-AI Decision-Making
39 pages
Idea 1 PPT Content Reference
No ratings yet
Idea 1 PPT Content Reference
5 pages
Explainable ML for Alzheimer's Diagnosis
No ratings yet
Explainable ML for Alzheimer's Diagnosis
18 pages
Graduation - Project Final
No ratings yet
Graduation - Project Final
52 pages