Published in Dataman in AI·PinnedAnomaly Detection with PyOD!(Revised on October 12, 2021) If you do anomaly detection, I recommend you to take a look of this great tool called the Python Outlier Detection (PyOD). …Anomaly Detection8 min read
Published in Dataman in AI·PinnedExplain Your Model with the SHAP ValuesBetter Interpretability Leads to Better Adoption Is your highly-trained model easy to understand? A sophisticated machine learning algorithm usually can produce accurate predictions, but its notorious “black box” nature does not help adoption at all. Think about this: If you ask me to swallow a black pill without telling me…Machine Learning13 min read
Published in Dataman in AI·PinnedA Primer to Image Pre-trained ModelsOver thirty years many researchers have produced a wealth of knowledge in image recognition algorithms and image data. If you are interested in image training but do not know where to start, I hope this post becomes a good starter for you. This post briefs the past evolution and points…Data Science9 min read
Published in Dataman in AI·PinnedThe SHAP Values with H2O ModelsMany machine learning algorithms are complicated and not easy to understand, even though they have rendered impressive level of accuracy. As humans, we must be able to fully understand how decisions are being made so that we can trust the decisions of AI systems. We need ML models to function…Data Science9 min read
Published in Dataman in AI·PinnedTop Data Science Interview Questions and AnswersYou receive a data science interview opportunity from your dream company. You have surveyed many the-top-50-question type of articles but still feel uncertain. Since there are already many similar articles, why do I dare to add an article to this crowded topic? In this article I re-write many ordinary answers…Data Science18 min read
Published in Dataman in AI·Updated 5 days agoHow Is the Partial Dependence Plot Computed?In a linear regression, the relationship between the target and a feature can be easily observed by the sign of the coefficient. How can we discover the relationships between the target and variables in a machine learning model? The solution is the Partial Dependence Plot (PDP). It shows the marginal…Data Science4 min read
Mar 29My Study Guide for AWS Certified Cloud Practitioner Exam — Cheat SheetAlthough the series of “My Study Guide for AWS Certified Cloud Practitioner Exam” provides an extensive survey on questions and answers, it will be ideal to have a “cheat sheet” for quick reference. For each topic, I restrict to less than 10 keywords that you absolutely have to know. If…AWS8 min read
Published in Dataman in AI·Updated Mar 29Build Your Voice Change App with AWS S3, Transcribe, Polly, and DynamoDBWhy Do I Build this Voice Change Project? I built this tutorial project because it involves two machine learning models, the serverless framework, the IAM, and the DynamoDB in AWS. By following the screenshots in this article step by step, you will be able to build the entire pipeline. You…AWS12 min read
Published in Dataman in AI·Feb 8The AWS Boto3 Tutorial to Help You to StartIf you have a free-tier AWS account and currently use the point-and-click interface of the AWS management console, you may want to take your skills to the next level by using Boto3. It lets you use AWS services from your Python code so you can streamline the entire process programmatically…Data Science7 min read
Published in Dataman in AI·Updated Feb 3A Wide Variety of Models for Multi-class ClassificationMany real life examples involve multiple selection. Rather than the “to be” or “not to be” by Hamlet, the choice may be multiple like “Yes”, “No”, “I don’t know”, and “I don’t want to choose”. Since we use data science to help our lives, we often need to predict an…Data Science16 min read