
Data Science techniques
What is data science?
Data science is a multidisciplinary field that uses a range of techniques in order to extract data, draw insights, and solve analytical problems. With the end goal normally being to create business value.
The fields that are involved in data science vary from mathematics, statistics, information science, and computer science.
Data science has been growing in importance due to the rise of 'big data'. The increase in the size of data, and its more unstructured form, means that it is less manageable to analyse. Data science has therefore become an important field in which to deal with these issues.
What are the techniques of data science?
There are a wealth of techniques used by data scientists, some of these include:
Linear Regression: This is the linear approach, i.e. a graphical representation on a straight line, which models the relationship of a dependent variable and independent variable in order to predict a target variable.
Clustering: This is where you divide and sort data points into specific groups so that the data points share similar traits. There are two types of clustering, hard clustering - where data points either fit into a group or they don't -, and soft clustering - this is where the probability of a particular data point being in the category is made.
Association analysis: This is where machine learning models analyse data points in a database for patterns, and consequently identifies 'if-then' associations, also known as 'association rules'. After this analysis, you are able to see the commonly occurring associations. Follow this link to find a more detailed definition of association analysis.
Logistic Regression: This type of model, frequently used in statistics, uses a logistic curve, or logistic function, for modelling a binary dependent variable, overcoming the classification problem. Read this useful article on logistic regression to learn more.
What are the main phases in data science?
Data Science involves many stages in order to reach the end goal. These can include:
Discover: This stage involves the formulation of the initial hypothesis, after the framing of the business problem. It is also necessary to evaluate the resources needed for the project.
Data Preparation: This is where search for, pre-process, and ready the data needed for the modeling process. This may involve preparing the analytics sandbox.
Model planning: This stage requires planning of the methods and techniques that are needed in order to draw relevant results.
Model building: Following planning, you collate the methods and techniques so that they form a model
Put in to the model practice running the data.
Present results: After collecting all the results, it is necessary to translate them into a more efficient and concise presentation.
To find out more about data science and the techniques necessary, please refer to these webpages:
To find techniques and templates for data science, please refer to the tools on Eloquens below.
Most popular techniques
- Testing year-end Investment Valuation of PSX listed companies111Discussadd_shopping_cart
Decision Tree Algorithm & Analysis
Edureka gives a comprehensive tutorial on decision tree analysis with the help of examples.186Discussfreeby Edureka
How To Correctly Validate Machine Learning Models
Whitepaper discussing the 4 main components for correctly validating machine learning models.125Discussadd_shopping_cartfreeby RapidMiner
Machine Learning Algorithms Tutorial
Teaching the basics of machine learning, along with the ways in which you can use machine learning for problem solving.91Discussfreeby Edureka
The Top 5 Algorithms used in Data Science
This video discusses the 5 most widely used algorithms in Data Science and how to use them.166Discussadd_shopping_cartfreeby Edureka
Building Robust Machine Learning Models
This presentation focuses on the fundamentals of building robust machine learning models.87Discussfreeby Data Science Dojo
Measuring Model Performance
Video tutorial on how to measure your model's performance.62Discussfreeby Data Camp
Choosing the Right Machine Learning Algorithm
Seth Mottaghinejad discusses the things we should be thinking about when choosing a machine learning algorithm.117Discussfreeby Conference Videos
How to Learn Machine Learning in 6 Months
Senior Data Scientist Zach Millar explains how you can learn machine learning in 6 months through a roadmap process.138Discussfreeby IDEAS
Linear Regression Algorithm Tutorial
Edureka explains the basics of linear regression with the use of examples and use cases.122Discussfreeby Edureka
OFFSET MATCH and Data Validation Excel Model Template
Quick and easy to use 2-tab Excel template for OFFSET MATCH and data validation.265Discussadd_shopping_cartfreeby Wall Street Prep
How to use and implement the Interpolate-Lookup function
This is a detailed guide on how to use and implement the Interpolate-Lookup function.516Discussadd_shopping_cartfreeby Prof. Ed Bodmer
How to Include Dummy Variables into a Regression
Learn how to include Dummy Variables into a Regression.851add_shopping_cartfreeby 365 Data Science
How to Apply INDEX and MATCH Separately and Combined | Advanced Excel
Learn how to apply both functions, INDEX and MATCH, separately and combined on Excel.641add_shopping_cartfreeby 365 Data Science
How to Classify Data | Types of Data
Read our article to find out the two main ways of classifying data.601add_shopping_cartfreeby 365 Data Science
How to Apply and Combine INDIRECT Excel Function with VLOOKUP
Learn how to apply and combine INDIRECT Excel Function with VLOOKUP.521add_shopping_cartfreeby 365 Data Science
Newly published
Data Science in Audit - Payroll Audit
Work paper to test reasonablness of Bonus payment compare to Salary and any indication of fraud.219Discussadd_shopping_cartData analysis & dynamic reporting for cloud accounting Xero and QBO using Excel
USING EXCEL TO AUTOMATE PROCESS IN XERO AND QBO? YES YOU CAN Using DataDear you are able to both pull and push data.458Discussadd_shopping_cartfreeby Lance Rubin
Python for Audit Testing (Valuation)
Data Science in External Auditing1,135Discussadd_shopping_cartData Science in Audit Investment Valuation Testing PSX
Testing year-end Investment Valuation of PSX listed companies111Discussadd_shopping_cartHow to use and implement the Interpolate-Lookup function
This is a detailed guide on how to use and implement the Interpolate-Lookup function.516Discussadd_shopping_cartfreeby Prof. Ed Bodmer
Data Science for Audit- Dividend Income Testing
Data Science for Audit, Testing Dividend Income Using Python91Discussadd_shopping_cartHow to Define Relational Database Essentials
Learn about the two main types of databases.491add_shopping_cartfreeby 365 Data Science
How to Differentiate Database and Spreadsheet
In this post, we will focus on the differences between database vs spreadsheet.571add_shopping_cartfreeby 365 Data Science
How to Create a Database
Learn more about basic database terminology before you start coding.611add_shopping_cartfreeby 365 Data Science
How to Add a Second “if” Statement | ELIF
Learn an elegant way of adding a second “if” statement to one of our expressions.351add_shopping_cartfreeby 365 Data Science
How to Define Python Tuples
Python tuples are another type of data sequences, but differently to lists, they are immutable...421add_shopping_cartfreeby 365 Data Science
How to Use Conditionals and Loops in Python
Let’s see how to combine conditionals and loops in Python.351add_shopping_cartfreeby 365 Data Science
How to Measure Asymmetry with Skewness
The most commonly used tool to measure asymmetry is skewness. Learn more about it by checking out this article.672add_shopping_cartfreeby 365 Data Science
How to Handle large data tables with ease | VLOOKUP COLUMN and ROW
Learn how to handle large data tables with ease!681add_shopping_cartfreeby 365 Data Science
How to Use the Simple Linear Regression Model | Geometrical Representation
Find out how to use the simple linear regression model through geometrical representation.441add_shopping_cartfreeby 365 Data Science
How to Use VLOOKUP and MATCH in Excel
We’ve seen several function combinations so far. In this lesson, we’ll present another one that can be useful.801add_shopping_cartfreeby 365 Data Science
Full catalog
Data analysis & dynamic reporting for cloud accounting Xero and QBO using Excel
USING EXCEL TO AUTOMATE PROCESS IN XERO AND QBO? YES YOU CAN Using DataDear you are able to both pull and push data.458Discussadd_shopping_cartfreeby Lance Rubin
Data Science in Audit Investment Valuation Testing PSX
Testing year-end Investment Valuation of PSX listed companies111Discussadd_shopping_cartHow to Apply The Central Limit Theorem
Learn how to apply the Central Limit Theorem in Statistics.601add_shopping_cartfreeby 365 Data Science
How to Use Student's T Distribution
Learn everything you need to know about Student's T Distribution.461add_shopping_cartfreeby 365 Data Science
How to use and implement the Interpolate-Lookup function
This is a detailed guide on how to use and implement the Interpolate-Lookup function.516Discussadd_shopping_cartfreeby Prof. Ed Bodmer
Python for Audit Testing (Valuation)
Data Science in External Auditing1,135Discussadd_shopping_cartOFFSET MATCH and Data Validation Excel Model Template
Quick and easy to use 2-tab Excel template for OFFSET MATCH and data validation.265Discussadd_shopping_cartfreeby Wall Street Prep
Data Science in Audit - Payroll Audit
Work paper to test reasonablness of Bonus payment compare to Salary and any indication of fraud.219Discussadd_shopping_cartHow to Learn Machine Learning in 6 Months
Senior Data Scientist Zach Millar explains how you can learn machine learning in 6 months through a roadmap process.138Discussfreeby IDEAS
Decision Tree Algorithm & Analysis
Edureka gives a comprehensive tutorial on decision tree analysis with the help of examples.186Discussfreeby Edureka
How To Correctly Validate Machine Learning Models
Whitepaper discussing the 4 main components for correctly validating machine learning models.125Discussadd_shopping_cartfreeby RapidMiner
Machine Learning Algorithms Tutorial
Teaching the basics of machine learning, along with the ways in which you can use machine learning for problem solving.91Discussfreeby Edureka
The Top 5 Algorithms used in Data Science
This video discusses the 5 most widely used algorithms in Data Science and how to use them.166Discussadd_shopping_cartfreeby Edureka
Building Robust Machine Learning Models
This presentation focuses on the fundamentals of building robust machine learning models.87Discussfreeby Data Science Dojo
Measuring Model Performance
Video tutorial on how to measure your model's performance.62Discussfreeby Data Camp
Choosing the Right Machine Learning Algorithm
Seth Mottaghinejad discusses the things we should be thinking about when choosing a machine learning algorithm.117Discussfreeby Conference Videos
- Have a Data Mining Technique to Share
Your Data Science Technique
Publish your technique
Learn more about digital publishing
Eloquens Member