Popular Posts
Setting up a Python Environment for Machine Learning and Data Science with Conda Virtual Environment and Jupyter Notebook in MacOS and Windows
data_science_and_data_miningYou are just getting started with Data Science, Machine Learning or Artificial Intelligence, and Python is one of the languages you have chosen. Right now you are working on setting up a Python…
Fully Remove Python and Install a Fresh Python in MacOS and Windows
data_science_and_data_miningThis post is the one for you if you are ready to start your programming journey with Python, and want to set up a Python environment in your Mac or PC. Python has become so popular that it has made…
A Beginner Guide to String Pattern Matching in R by Regular Expression Part 2 Examples
data_science_and_data_miningSeveral code examples of using regular expressions with R for string processing. This is one part of A Beginner Guide to String Pattern Matching in R by Regular Expression.
A Beginner Guide to String Pattern Matching in R by Regular Expression Part 1
data_science_and_data_miningFormal textual content is a mixture of words and punctuations while online conversational text comes with symbols, emoticons and misspellings. Before performing analysis or building a learning model,…
Building and Evaluating a Predictive Model w/ Linear Regression in RapidMiner Studio
data_science_and_data_miningLinear regression is a simple while practical model for making predictions in many fields. In order to apply linear regression to a dataset and evaluate how well the model will perform, we can build a…
What is a Database Driver? Access Database in VS Code. R. Python. Java
databaseIn a computer system, an adaptor program is required for making a connection to another system of different type. Similar to connecting a printer to a computer by using a printer driver, a DBMS…
Data Binning and Plotting in R
data_science_and_data_miningUpdated on 9/28/2019
Outlier Detection by Data Visualization with Boxplot
data_science_and_data_miningOutliers in a collection of data are the values which are far away from most other points. A boxplot is usually used to visualize a dataset for spotting unusual data points. However, is an outlier…