
Data Science techniques 1 comment

What is data science?
Data science is a multidisciplinary field that uses a range of techniques in order to extract data, draw insights, and solve analytical problems. With the end goal normally being to create business value.
The fields that are involved in data science vary from mathematics, statistics, information science, and computer science.
Data science has been growing in importance due to the rise of 'big data'. The increase in the size of data, and its more unstructured form, means that it is less manageable to analyse. Data science has therefore become an important field in which to deal with these issues.
What are the techniques of data science?
There are a wealth of techniques used by data scientists, some of these include:
Linear Regression: This is the linear approach, i.e. a graphical representation on a straight line, which models the relationship of a dependent variable and independent variable in order to predict a target variable.
Clustering: This is where you divide and sort data points into specific groups so that the data points share similar traits. There are two types of clustering, hard clustering - where data points either fit into a group or they don't -, and soft clustering - this is where the probability of a particular data point being in the category is made.
Association analysis: This is where machine learning models analyse data points in a database for patterns, and consequently identifies 'if-then' associations, also known as 'association rules'. After this analysis, you are able to see the commonly occurring associations. Follow this link to find a more detailed definition of association analysis.
Logistic Regression: This type of model, frequently used in statistics, uses a logistic curve, or logistic function, for modelling a binary dependent variable, overcoming the classification problem. Read this useful article on logistic regression to learn more.
What are the main phases in data science?
Data Science involves many stages in order to reach the end goal. These can include:
Discover: This stage involves the formulation of the initial hypothesis, after the framing of the business problem. It is also necessary to evaluate the resources needed for the project.
Data Preparation: This is where search for, pre-process, and ready the data needed for the modeling process. This may involve preparing the analytics sandbox.
Model planning: This stage requires planning of the methods and techniques that are needed in order to draw relevant results.
Model building: Following planning, you collate the methods and techniques so that they form a model
Put in to the model practice running the data.
Present results: After collecting all the results, it is necessary to translate them into a more efficient and concise presentation.
To find out more about data science and the techniques necessary, please refer to these webpages:
To find techniques and templates for data science, please refer to the tools on Eloquens below.
Most popular techniques
- Testing year-end Investment Valuation of PSX listed companies139Discussadd_shopping_cart
Decision Tree Algorithm & Analysis
Edureka gives a comprehensive tutorial on decision tree analysis with the help of examples.203Discussfreeby Edureka
How To Correctly Validate Machine Learning Models
Whitepaper discussing the 4 main components for correctly validating machine learning models.139Discussadd_shopping_cartfreeby RapidMiner
Machine Learning Algorithms Tutorial
Teaching the basics of machine learning, along with the ways in which you can use machine learning for problem solving.102Discussfreeby Edureka
The Top 5 Algorithms used in Data Science
This video discusses the 5 most widely used algorithms in Data Science and how to use them.185Discussadd_shopping_cartfreeby Edureka
Building Robust Machine Learning Models
This presentation focuses on the fundamentals of building robust machine learning models.96Discussfreeby Data Science Dojo
Measuring Model Performance
Video tutorial on how to measure your model's performance.71Discussfreeby Data Camp
Choosing the Right Machine Learning Algorithm
Seth Mottaghinejad discusses the things we should be thinking about when choosing a machine learning algorithm.125Discussfreeby Conference Videos
How to Learn Machine Learning in 6 Months
Senior Data Scientist Zach Millar explains how you can learn machine learning in 6 months through a roadmap process.150Discussfreeby IDEAS
Linear Regression Algorithm Tutorial
Edureka explains the basics of linear regression with the use of examples and use cases.128Discussfreeby Edureka
OFFSET MATCH and Data Validation Excel Model Template
Quick and easy to use 2-tab Excel template for OFFSET MATCH and data validation.303Discussadd_shopping_cartfreeby Wall Street Prep
How to use and implement the Interpolate-Lookup function
This is a detailed guide on how to use and implement the Interpolate-Lookup function.555Discussadd_shopping_cartfreeby Prof. Ed Bodmer
How to Include Dummy Variables into a Regression
Learn how to include Dummy Variables into a Regression.921add_shopping_cartfreeby 365 Data Science
How to Apply INDEX and MATCH Separately and Combined | Advanced Excel
Learn how to apply both functions, INDEX and MATCH, separately and combined on Excel.691add_shopping_cartfreeby 365 Data Science
How to Classify Data | Types of Data
Read our article to find out the two main ways of classifying data.711add_shopping_cartfreeby 365 Data Science
How to Apply and Combine INDIRECT Excel Function with VLOOKUP
Learn how to apply and combine INDIRECT Excel Function with VLOOKUP.581add_shopping_cartfreeby 365 Data Science
Newly published
Data Science in Audit - Payroll Audit
Work paper to test reasonablness of Bonus payment compare to Salary and any indication of fraud.286Discussadd_shopping_cartData analysis & dynamic reporting for cloud accounting Xero and QBO using Excel
USING EXCEL TO AUTOMATE PROCESS IN XERO AND QBO? YES YOU CAN Using DataDear you are able to both pull and push data.500Discussadd_shopping_cartfreeby Lance Rubin
Python for Audit Testing (Valuation)
Data Science in External Auditing1,213Discussadd_shopping_cartData Science in Audit Investment Valuation Testing PSX
Testing year-end Investment Valuation of PSX listed companies139Discussadd_shopping_cartHow to use and implement the Interpolate-Lookup function
This is a detailed guide on how to use and implement the Interpolate-Lookup function.555Discussadd_shopping_cartfreeby Prof. Ed Bodmer
Data Science for Audit- Dividend Income Testing
Data Science for Audit, Testing Dividend Income Using Python105Discussadd_shopping_cartHow to Define Relational Database Essentials
Learn about the two main types of databases.601add_shopping_cartfreeby 365 Data Science
How to Differentiate Database and Spreadsheet
In this post, we will focus on the differences between database vs spreadsheet.661add_shopping_cartfreeby 365 Data Science
How to Create a Database
Learn more about basic database terminology before you start coding.681add_shopping_cartfreeby 365 Data Science
How to Add a Second “if” Statement | ELIF
Learn an elegant way of adding a second “if” statement to one of our expressions.421add_shopping_cartfreeby 365 Data Science
How to Define Python Tuples
Python tuples are another type of data sequences, but differently to lists, they are immutable...481add_shopping_cartfreeby 365 Data Science
How to Use Conditionals and Loops in Python
Let’s see how to combine conditionals and loops in Python.421add_shopping_cartfreeby 365 Data Science
How to Measure Asymmetry with Skewness
The most commonly used tool to measure asymmetry is skewness. Learn more about it by checking out this article.752add_shopping_cartfreeby 365 Data Science
How to Handle large data tables with ease | VLOOKUP COLUMN and ROW
Learn how to handle large data tables with ease!761add_shopping_cartfreeby 365 Data Science
How to Use the Simple Linear Regression Model | Geometrical Representation
Find out how to use the simple linear regression model through geometrical representation.491add_shopping_cartfreeby 365 Data Science
How to Use VLOOKUP and MATCH in Excel
We’ve seen several function combinations so far. In this lesson, we’ll present another one that can be useful.891add_shopping_cartfreeby 365 Data Science
Full catalog
Data Science in Audit Investment Valuation Testing PSX
Testing year-end Investment Valuation of PSX listed companies139Discussadd_shopping_cartHow to Apply The Central Limit Theorem
Learn how to apply the Central Limit Theorem in Statistics.671add_shopping_cartfreeby 365 Data Science
How to Use Student's T Distribution
Learn everything you need to know about Student's T Distribution.531add_shopping_cartfreeby 365 Data Science
How to use and implement the Interpolate-Lookup function
This is a detailed guide on how to use and implement the Interpolate-Lookup function.555Discussadd_shopping_cartfreeby Prof. Ed Bodmer
Data analysis & dynamic reporting for cloud accounting Xero and QBO using Excel
USING EXCEL TO AUTOMATE PROCESS IN XERO AND QBO? YES YOU CAN Using DataDear you are able to both pull and push data.500Discussadd_shopping_cartfreeby Lance Rubin
Python for Audit Testing (Valuation)
Data Science in External Auditing1,213Discussadd_shopping_cartOFFSET MATCH and Data Validation Excel Model Template
Quick and easy to use 2-tab Excel template for OFFSET MATCH and data validation.303Discussadd_shopping_cartfreeby Wall Street Prep
Data Science in Audit - Payroll Audit
Work paper to test reasonablness of Bonus payment compare to Salary and any indication of fraud.286Discussadd_shopping_cartHow to Learn Machine Learning in 6 Months
Senior Data Scientist Zach Millar explains how you can learn machine learning in 6 months through a roadmap process.150Discussfreeby IDEAS
Decision Tree Algorithm & Analysis
Edureka gives a comprehensive tutorial on decision tree analysis with the help of examples.203Discussfreeby Edureka
How To Correctly Validate Machine Learning Models
Whitepaper discussing the 4 main components for correctly validating machine learning models.139Discussadd_shopping_cartfreeby RapidMiner
Machine Learning Algorithms Tutorial
Teaching the basics of machine learning, along with the ways in which you can use machine learning for problem solving.102Discussfreeby Edureka
The Top 5 Algorithms used in Data Science
This video discusses the 5 most widely used algorithms in Data Science and how to use them.185Discussadd_shopping_cartfreeby Edureka
Building Robust Machine Learning Models
This presentation focuses on the fundamentals of building robust machine learning models.96Discussfreeby Data Science Dojo
Measuring Model Performance
Video tutorial on how to measure your model's performance.71Discussfreeby Data Camp
Choosing the Right Machine Learning Algorithm
Seth Mottaghinejad discusses the things we should be thinking about when choosing a machine learning algorithm.125Discussfreeby Conference Videos
- Have a Data Mining Technique to Share
Your Data Science Technique
Publish your technique
Learn more about digital publishing
Eloquens Member