Recreating PyTorch from Scratch (with GPU Support and Automatic Differentiation) | by Lucas de Lima Nogueira

[ad_1]

Construct your individual deep studying framework primarily based on C/C++, CUDA, and Python, with GPU assist and automated differentiation

Picture by Writer with the help of AI (https://copilot.microsoft.com/images/create)

For a few years I’ve been utilizing PyTorch to assemble and prepare deep studying fashions. Despite the fact that I’ve realized its syntax and guidelines, one thing has at all times aroused my curiosity: what is occurring internally throughout these operations? How does all of this work?

When you have gotten right here, you most likely have the identical questions. If I ask you how you can create and prepare a mannequin in PyTorch, you’ll most likely give you one thing just like the code under:

import torch
import torch.nn as nn
import torch.optim as optimclass MyModel(nn.Module):
def __init__(self):
tremendous(MyModel, self).__init__()
self.fc1 = nn.Linear(1, 10)
self.sigmoid = nn.Sigmoid()
self.fc2 = nn.Linear(10, 1)
def ahead(self, x):
out = self.fc1(x)
out = self.sigmoid(out)
out = self.fc2(out)
return out
...
mannequin = MyModel().to(machine)
criterion = nn.MSELoss()
optimizer = optim.SGD(mannequin.parameters(), lr=0.001)
for epoch in vary(epochs):
for x, y in ...
x = x.to(machine)
y = y.to(machine)
outputs = mannequin(x)
loss = criterion(outputs, y)
optimizer.zero_grad()
loss.backward()
optimizer.step()

However what if I ask you the way does this backward step works? Or, for example, what occurs once you reshape a tensor? Is the info rearranged internally? How does that occurs? Why is PyTorch so quick? How does PyTorch deal with GPU operations? These are the forms of questions which have at all times intrigued me, and I think about in addition they intrigue you. Thus, with the intention to higher perceive these ideas, what is healthier than constructing your individual tensor library from scratch? And that’s what you’ll be taught on this article!

With a view to assemble a tensor library, the primary idea you might want to be taught clearly is: what’s a tensor?

You might have an intuitive concept {that a} tensor is a mathematical idea of a n-dimensional information construction that comprises some numbers. However right here we have to perceive how you can mannequin this information construction from a computational perspective. We are able to consider a tensor as consisting of the info itself and likewise some metadata describing points of the tensor comparable to its form or the machine it lives in (i.e. CPU reminiscence, GPU reminiscence…).

There may be additionally a much less well-liked metadata that you could have by no means heard of, referred to as stride. This idea is essential to know the internals of tensor information rearrangement, so we have to talk about it a bit of extra.

Think about a 2-D tensor with form (4, 8), illustrated under.

The information (i.e. float numbers) of a tensor is definitely saved as a 1-dimensional array on reminiscence:

1-D dimensional information array of the tensor (Picture by Writer)

So, with the intention to characterize this 1-dimensional array as a N-dimensional tensor, we use strides. Principally the thought is the next:

Now we have a matrix with 4 rows and eight columns. Contemplating that every one of its components are organized by rows on the 1-dimensional array, if we need to entry the worth at place (2, 3), we have to traverse 2 rows (of 8 components every) plus 3 positions. In mathematical phrases, we have to traverse 3 + 2 * 8 components on the 1-dimensional array:

So this ‘8’ is the stride of the second dimension. On this case, it’s the info of what number of components I have to traverse on the array to “leap” to different positions on the second dimension.

Thus, for accessing the aspect (i, j) of a 2-dimensional tensor with form (shape_0, shape_1), we principally have to entry the aspect at place j + i * shape_1

Now, allow us to think about a three-dimensional tensor:

You’ll be able to consider this three-dimensional tensor as a sequence of matrices. For instance, you may consider this (5, 4, 8) tensor as 5 matrices of form (4, 8).

Now, with the intention to entry the aspect at place (1, 3, 7), you might want to traverse 1 full matrix of form (4,8), 2 rows of form (8) and seven columns of form (1). So, you might want to traverse (1 * 4 * 8) + (2 * 8) + (7 * 1) positions on the 1-dimensional array.

Thus, to entry the aspect (i)(j)(okay) of a 3-D tensor with (shape_0, shape_1, shape_2) on the 1-D information array, you do:

This shape_1 * shape_2 is the stride of the primary dimension, the shape_2 is the stride of the second dimension and 1 is the stride of the third dimension.

Then, with the intention to generalize:

The place the strides of every dimension may be calculated utilizing the product of the following dimension tensor shapes:

Then we set stride(n-1) = 1.

On our tensor instance of form (5, 4, 8) we might have strides = (4*8, 8, 1) = (32, 8, 1)

You’ll be able to take a look at by yourself:

import torchtorch.rand((5, 4, 8)).stride()
#(32, 8, 1)

Okay, however why do we want shapes and strides? Past accessing components of N-dimensional tensors saved as 1-dimensional arrays, this idea can be utilized to govern tensor preparations very simply.

For instance, to reshape a tensor, you solely have to set the brand new form and calculate the brand new strides primarily based on it! (for the reason that new form ensures the identical variety of components)

import torcht = torch.rand((5, 4, 8))
print(t.form)
# (5, 4, 8)
print(t.stride())
# (32, 8, 1)
new_t = t.reshape((4, 5, 2, 2, 2))
print(new_t.form)
# (4, 5, 2, 2, 2)
print(new_t.stride())
# (40, 8, 4, 2, 1)

Internally, the tensor remains to be saved as the identical 1-dimensional array. The reshape methodology didn’t change the order of the weather throughout the array! That’s wonderful, isn’t? 😁

You’ll be able to confirm by yourself utilizing the next operate that accesses the inner 1-dimensional array on PyTorch:

import ctypes
def print_internal(t: torch.Tensor):
print(
torch.frombuffer(
ctypes.string_at(t.data_ptr(), t.storage().nbytes()), dtype=t.dtype
)
)
print_internal


[ad_2]


Source link

Giá cả của Styler, Ưu điểm Nhược điểm, Tính năng, Các lựa chọn thay thế

Phục vụ nhiều bộ điều hợp LoRA với vLLM | của Benjamin Marie | Tháng 8 năm 2024

Những cân nhắc thiết yếu để triển khai học máy | của Conal Henderson | Tháng 7 năm 2024

Quy trình mua hàng để thanh toán & cách tối ưu hóa chu trình P2P

Giá InVideo, Ưu điểm Nhược điểm, Tính năng, Các lựa chọn thay thế

Tìm hiểu KAN: Giải pháp thay thế mới nhất cho MLP

Mở khóa khả năng kết nối liền mạch trên khắp châu Âu với eSIM Prune Europe

AI và Nguồn nhân lực: Chuyển đổi Tương lai của Quản lý Lực lượng lao động

Most Popular

Quy trình mua hàng để thanh toán & cách tối ưu hóa chu trình P2P

Giá InVideo, Ưu điểm Nhược điểm, Tính năng, Các lựa chọn thay thế

Tìm hiểu KAN: Giải pháp thay thế mới nhất cho MLP

Our Picks

Google cuối cùng cũng hành động để hạn chế deepfake không có sự đồng thuận

Nghiên cứu đồng hành của Cognizant & Oxford Economics với Báo cáo “Công việc mới, Thế giới mới” cho thấy sự lạc quan thận trọng trong các doanh nghiệp áp dụng AI

Làm thế nào để truy cập mô hình GitHub trong vài bước?

Recreating PyTorch from Scratch (with GPU Support and Automatic Differentiation) | by Lucas de Lima Nogueira | May, 2024

Construct your individual deep studying framework primarily based on C/C++, CUDA, and Python, with GPU assist and automated differentiation

Related Posts