PinnedWei YiinTowards Data ScienceHow Does an Image-Text Multimodal Foundation Model WorkLearn how an image-text multi-modality model can perform image classification, image retrieval, and image captioning·19 min read·Jun 1, 2024--3--3

PinnedWei YiinTowards Data ScienceHow Does the Segment-Anything Model’s (SAM’s) Encoder Work?a deep dive into how image content embedding, sine and cosine positional embedding, guidance click embedding and dense mask embedding is…·16 min read·May 14, 2024--2--2

PinnedWei YiinTowards Data ScienceHow does the Segment-Anything Model’s (SAM’s) decoder work?A deep dive into how the Segment-Anything model’s decoding procedure, with a focus on how its self-attention and cross-attention mechanism…·18 min read·Mar 24, 2024--1--1

PinnedWei YiinTowards Data ScienceSpeeding up vision transformer prediction by 9 times faster with PyTorch, ONNX and TensorRTHow to use 16bit float, TensorRT, network rewriting and multi-threading to dramatically speed up deep learning model prediction·11 min read·Jun 4, 2023----

Wei YiinTowards Data ScienceHow Decision Trees Split Nodes, from Loss Function PerspectiveLearn how a decision tree splits nodes only to minimize its loss function·12 min read·May 15, 2023--1--1

Wei YiinTowards Data ScienceDistributed data parallel and distributed model parallel in PyTorchHow distributed data parallel DDP and distributed model parallel DMP works in stochastic gradient descent with large models and huge data·14 min read·May 8, 2023--1--1

Wei YiinTowards Data ScienceUnderstanding the Denoising Diffusion Probabilistic Model, the Socratic WayA deep dive into the motivation behind the denoising diffusion model and detailed derivations for the loss function·69 min read·Feb 25, 2023--5--5

Wei YiinTowards Data ScienceThe Input-output Attention Mechanism from “Neural Machine Translation by Jointly Learning…Learn the math and intuition behind the input-output attention mechanism in a RNN-based language to language translation model·11 min read·Mar 18, 2022----

Wei YiinTowards Data ScienceCan We Use Stochastic Gradient Descent (SGD) on a Linear Regression Model?Learn why it is valid to use SGD on a linear regression model for parameter learning, see however, SGD can be inefficient, and appreciate…·17 min read·Aug 5, 2021----

Wei YiinTowards Data ScienceWhere do confidence interval in linear regression come from — the case of least squares formulationThis article explains in least square linear regression model, how to understand parameter std err, t, P>|t| and confidence intervals.·21 min read·Jun 28, 2021----