Nandini Lokesh ReddyRT-DETR: The Next Evolution in Real-Time Object DetectionReal-time object detection is a vital field with a wide array of applications, from object tracking to autonomous driving. Imagine a…Jul 19, 20241Jul 19, 20241
InTDS ArchivebyCristian LeoThe Math Behind Multi-Head Attention in TransformersDeep Dive into Multi-Head Attention, the secret element in Transformers and LLMs. Let’s explore its math, and build it from scratch.Jul 16, 20243Jul 16, 20243
InTDS ArchivebyDaniel WarfieldYOLO — By HandA breakdown of the math within YOLOJun 7, 20242Jun 7, 20242
InTDS ArchivebyEthan Yanjia LiDive Really Deep into YOLO v3: A Beginner’s GuideA detailed review and hands-on implementation of YOLO v3 in TensorFlow 2 for beginners.Dec 30, 201920Dec 30, 201920
Sanna PerssonYOLOv3 — Implementation with Training setup from ScratchFor such a popular paper there are still few implementations explained of the YOLOv3 architecture completely from scratch. I’ll do my best…Mar 21, 20214Mar 21, 20214
Kaushik KoneripalliSatellite Image Data Augmentation using Stable Diffusion for Object detection & segmentation.Introduction:Sep 2, 20231Sep 2, 20231
Sik-Ho TsangBrief Review — TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object…TPH-YOLOv5, Detects Small & Dense Objects in Drone ImagesJun 18, 2023Jun 18, 2023
LakeraHow robust are pre-trained object detection ML models like YOLO or DETR?We tested the robustness of state-of-the-art computer vision models to assess their generalization ability. Here is what we found.Nov 2, 20222Nov 2, 20222
InTDS ArchivebyMostafa IbrahimWBF: Optimizing object detection — Fusing & Filtering predicted boxesWeighted boxes fusion has become the new SOTA to optimize object detection modelsMar 17, 20211Mar 17, 20211
HugegeneA Single Camera 3D FunctionsIntrinsic Matrix, Extrinsic Matrix, Homography, Inverse Perspective MappingDec 5, 2021Dec 5, 2021
AmalImprove your Object Detection and Instance Segmentation Results for Detection of Small ObjectsOne of the important tasks of computer vision is to identify objects in real-time, what if we have large image of large size and the…Feb 17, 20222Feb 17, 20222
Vaibhav BagriLooking at Research Work in Real Time Object DetectionObject detection refers to the ability of computers to accurately detect the presence of particular objects in an image and accurately draw…Mar 31, 2022Mar 31, 2022
InTDS ArchivebyJohannes RiekeObject detection with neural networksA simple tutorial using kerasJun 12, 201725Jun 12, 201725
Chris HaWhy this one (literally) small model spells big things for Vision Transformers.MobileVitOct 12, 20211Oct 12, 20211
InTDS ArchivebyMiguel PintoLearn AI Today 05: Image segmentation with U-Net modelsA simple introduction to image segmentation for practical Deep LearningJan 15, 20211Jan 15, 20211
PallawiSemantic segmentation with U-Net- train, and test on your custom data in KerasWhat is semantic segmentation?Jun 3, 201918Jun 3, 201918
InCoinmonksbySukriti PaulLearn How to Train U-Net On Your DatasetWith the aim of performing semantic segmentation on a small bio-medical data-set, I made a resolute attempt at demystifying the workings of…Jun 8, 201838Jun 8, 201838
InTDS ArchivebyKonstantin KutzkovBilinear pooling for fine-grained visual recognition and multi-modal deep learningAdvanced neural network architectures work by learning feature interactionsOct 7, 2021Oct 7, 2021
InTDS ArchivebyCameron R. Wolfe, Ph.D.Deep Learning on Video (Part Two): The Rise of Two-Stream ArchitecturesHow two-stream network architectures revolutionized deep learning on video.Jan 28, 2022Jan 28, 2022
InTDS ArchivebyAllohvkSwin/Vision Transformers — Hacking the human eyeOne ML architecture to rule them all? Perhaps not…Jan 17, 20225Jan 17, 20225