PinnedPeggy ChanginTowards Data ScienceLucene Inside Out — Dealing With Integer Encoding and CompressionDelve into PackedInts, VInt, FixedBitSet, and RoaringDocIdSet (Roaring Bitmaps)·13 min read·Jun 28, 2023----
PinnedPeggy ChanginTowards Data ScienceIVFPQ + HNSW for Billion-scale Similarity SearchThe best indexing approach for billion-sized vector datasets·17 min read·Aug 29, 2022--4--4
PinnedPeggy ChanginTowards Data ScienceSimilarity Search with IVFPQFind out how the inverted file index (IVF) is implemented alongside product quantization (PQ) for a fast and efficient approximate nearest…·9 min read·May 25, 2022--4--4
PinnedPeggy ChanginTowards Data ScienceProduct Quantization for Similarity SearchHow to compress and fit a humongous set of vectors in memory for similarity search with asymmetric distance computation (ADC)·8 min read·May 9, 2022--4--4
PinnedPeggy ChanginTowards Data ScienceAdvanced Techniques for Fine-tuning TransformersLearn these advanced techniques and see how they can help improve results·11 min read·Sep 17, 2021--4--4
Peggy ChangMastering Dynamic Programming IIManual tabulation and workout is a great way to start grokking, analyzing, and spotting patterns, as well as strengthening our…·10 min read·Apr 18, 2022--1--1
Peggy ChanginTowards Data ScienceMastering Dynamic ProgrammingUnderstanding the fundamentals and knowing when and how to apply this optimization technique·16 min read·Feb 28, 2022--2--2
Peggy ChanginTowards AIBuilding a Product Recommendation Engine with AWS SageMakerLearn how to build and train a personalized recommender engine with Amazon SageMaker Factorization Machines·13 min read·Nov 24, 2021--1--1
Peggy ChanginTowards Data ScienceAWS Certified Machine Learning — SpecialtyTips and suggestions on how to prepare and pass the exam·9 min read·Oct 7, 2021--2--2
Peggy ChanginTowards Data ScienceTransformers, can you rate the complexity of reading passages?Fine-tuning RoBERTa with PyTorch to predict reading ease of text excerpts·14 min read·Aug 18, 2021--6--6