Skip to main content

ML Foundations Course by Great Learning- Notes

In this blog, I am going to post my notes, assignments etc that I did during my course on ML Foundations by Great Learning.

ML Maths Basics

Topics Covered
  • Line Concept
  • Line, Planes and Hyper planes
  • Vector Algebra-magnitude and dimension
  • Vector Algebra-vector operations
  • Dot Product
  • Matrix Algebra
  • Functions
  • Maxima and Minima of Functions
  • Chain Rule
  • Maxima and Minima Applications in ML
  • Gradient Descent using Partial Derivatives












Intro to AI and ML

AI-computer program that does something smart or makes smart decisions

When computer program learns about the world from data we call it ML.
We assume past is a good representation of future.

Model building from data
  • take data as input
  • find patterns in data
  • summarise the pattern in a mathematically precise way
Machine Learning automates this model building.
If data is without noise then finding a pattern is easy but unfortunately data contains both data and noise. 
Noise is unstructured and random. It does not repeat itself.
ML does not assume data came from a specific model but statistics does.
ML tries all models to separate information from noise and find out which does better.
Most complicated model turns out to be that fits both information and data. This is called over fitting.
Very simple model manages to leave out information available to us which is called under fitting.
The goal is to find a balance and build a model that manages to capture as much information as it can and leave out noise. ML folks try a sequence of models.






Comments

Popular posts from this blog

OCR Image Text Detection and Image Manipulation Project

Developed as a course project, the main goal behind this project was to test ability to learn and use python libraries , use openCV to detect faces, tesseract to do optical character recognition and ability to use PIL to composite images together into contact sheets.  Task was to write python code which allows one to search through the images looking for occurrences of keywords and faces, to perform text detection on newspaper images data and  r eturn a contact sheet of all the faces which were located on the newspaper page which mentions that text . I divided whole task into subtasks into functions like, get files, binarise, to check is string is found, to chow faces, to show sheet,  building contact sheet and used libraries to achieve each task, like for images used PIL, cv2, etc 

Successful Presentation Notes

4 Modules Fear, The Formula, Practice, Engaging the audience Its takes practice and confidence Successful presentation is a public performance of the private self. Prof. Kuskin’s ten best practices for successful public speaking- Number 1 The Keith Code Rule: Fear inhibits execution. By controlling fear, we are able to execute on multiple levels. Number 2 Successful Presentation is storytelling. Number 3 Stories have a Beginning, Middle and Ending and often have single main meaning or moral. Number 4 Stories are always, in part, about the storyteller or people want to see people. Number 5 Stories are told; indeed, great stories are performed. Number 6 The Basic Formula is a simple three by three grid: Introduction, Content, & Conclusion, each of which have three parts: a. The Introduction has three elements: Salutation, Review, & The One Compelling Point b. The Content has three parts: Topic, Data, & Analysis c. The Conclusion has three parts: Summary, Discussion, & Tha...

Guided Projects from DataQuest

Guided Project: Finding Heavy Traffic Indicators on I-94 This project was about analysing a dataset about the westbound traffic on the I-94 Interstate highway and goal of analysis was to determine a few indicators of heavy traffic on I-94 which could be weather type, time of the day, time of the week, etc. My analysis concluded summer months, business days and 7AM or 4PM hours indicated very high traffic volume while winter months, weekends, normal hours were low on traffic and two weather conditions light rain and snow and clear sky also indicated moderately high traffic. I used exploratory data analysis and visualization to work through this analysis. Code Link Guided Project: Exploring Hacker News Posts I worked with Hacker News Posts Dataset to determine the questions like: Do Ask HN or Show HN receive more comments on average? Do posts created at a certain time receive more comments on average? I worked on preparing dataset, cleaning it, sorting using python strings, date and time...