Reproduce GPT2 (124M) by Andrej Karpathy 2 Weights and Bias Initials Normalization, BatchNorm and BackProp in Makemore

Diving deeper into Makemore codes to illustrate subtle details affecting the nn output. For example, the concept of "dead neurons" if the squashing function, say, tanh squashed too many inputs to the polar data points of -1 and +1, causing previous neuron's gradients killed: By resetting the scale of the weights and bias initialized, the … Continue reading Reproduce GPT2 (124M) by Andrej Karpathy 2 Weights and Bias Initials Normalization, BatchNorm and BackProp in Makemore

Reproduce GPT2 (124M) by Andrej Karpathy 3 Tokenization

Tokenization is the process of breaking down text into smaller units (tokens) such as words, subwords, or characters. Different tokenization methods are used based on the task, language, and requirements of the model. Word-Based Tokenization Character-Based Tokenization, this increase computation cost a lot. Subword-Based Tokenization like Byte Pair Encoding (BPE), Wordpiece of BERT, SentencePiece. Sentence … Continue reading Reproduce GPT2 (124M) by Andrej Karpathy 3 Tokenization

Reproduce GPT2 (124M) by Andrej Karpathy 1 Micrograd

Follow the github page by Dr. Karparthy: https://github.com/karpathy/build-nanogpt and youtube: https://www.youtube.com/watch?v=l8pRSuU81PU&list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ&t=1175s, the whole series is worth deep diving hence here it is, right from the very beginning. First, basics by Andrej, he illustrated how manually applied micro level backward propagation in the first great video: macrograd! with these two simple py file, engine.py and nn.py, … Continue reading Reproduce GPT2 (124M) by Andrej Karpathy 1 Micrograd

Decompose Scipy Linprog.py

""" A top-level linear programming interface. .. versionadded:: 0.15.0 Functions --------- .. autosummary:: :toctree: generated/ linprog linprog_verbose_callback linprog_terse_callback """ import numpy as np from ._optimize import OptimizeResult, OptimizeWarning from warnings import warn from ._linprog_highs import _linprog_highs from ._linprog_ip import _linprog_ip from ._linprog_simplex import _linprog_simplex from ._linprog_rs import _linprog_rs from ._linprog_doc import (_linprog_highs_doc, _linprog_ip_doc, # noqa: … Continue reading Decompose Scipy Linprog.py

Genesis and Isaac Platforms

According to Perplexity, Genesis and Isaac are both platforms designed for robotics and AI development, but they have distinct features and focus areas: Genesis Platform Simulation Speed: Genesis can run simulations up to 43 million frames per second, which is 430,000 times faster than real-time7. Physics Engine: Integrates various state-of-the-art physics solvers into a unified … Continue reading Genesis and Isaac Platforms