IAML Blog

Home Blog

Dal k-NN al Transformer in pochi passi (quasi)

25, Jan Simone Scardapane

Uno degli articoli scientifici più influenti dell'ultima decade è sicuramente Attention is All You Need.¹ Come da titolo, l'obiettivo dell'articolo era semplice: mostrare come una componente delle reti neurali fino a quel momento di nicchia (neural attention, o semplicemente attenzione in questo post) bastava da sola a costruire architetture neurali estremamente sofisticate. La famiglia di modelli così ottenuti, i Transfor…

Continue Reading...

Search the blog

Categories

Tutorials (16)

Discussions (12)

Announcements (4)

Tutorials (English) (4)

Articles' summaries (3)

Discussions (English) (2)

Focus-on (1)

Reviews (1)

Discussion (1)

Archives

January 2021 (1)

August 2020 (1)

June 2020 (1)

April 2020 (2)

February 2020 (1)

December 2019 (1)

September 2019 (1)

June 2019 (1)

May 2019 (1)

April 2019 (1)

March 2019 (4)

February 2019 (3)

Popular Tags

deep learning (11)

pytorch (9)

reti neurali (5)

google (4)

jit (4)

tensorflow (4)

ottimizzazione (4)

rete neurale (3)

time series (3)

keras (3)

reti convolutive (3)

pipeline (2)

sklearn (2)

autodiff (2)

automatic differentation (2)

reverse-mode (2)

derivate (2)

differenziazione (2)

model selection (2)

cross validation (2)

c++ (2)

numpy (2)

vmap (2)

caffe (2)

compiler (2)

jax (2)

codemotion (1)

bias (1)

discrimination (1)

fairness (1)

iaml (1)

database (1)

iperparametri (1)

autograph (1)

head (1)

multi-task (1)

learning (1)

novità (1)

dev summit (1)

custom estimator (1)

hyperopt (1)

goodfellow (1)

nlp (1)

dati mancanti (1)

transformer (1)

attenzione (1)

robocop (1)

yolo (1)

object detection (1)

bayes (1)

autoencoders (1)

variational (1)

eager (1)

imputazione (1)

CIFAR (1)

word embedding (1)

MNIST (1)

immagini (1)

classificazione (1)

kpi (1)

reprogramming (1)

adversarial (1)

browser (1)

javascript (1)

reti ricorsive (1)

reti ricorrenti (1)

ftth (1)

adversarial example (1)

management (1)

robotica (1)

ocr (1)

focus (1)

iphone (1)

python (1)

face id (1)

momento (1)

adam (1)

neuroscienza (1)

onde cerebrali (1)

torchvision (1)

latin (1)

pretrained (1)

rete convolutiva (1)

autograd (1)

swish (1)

attivazione (1)

checkpoint (1)

tensori (1)

variabili (1)

lineare (1)

regressione (1)

convolutional networks (1)

Vatican (1)

project (1)

kernel (1)

ICLR (1)

ipotesi (1)

sparsità (1)

funzionale (1)

functional (1)

adversarial attack (1)

kmeans (1)

analysis (1)

clustering (1)

Google (1)

regression (1)

JAX (1)

gaussian process (1)

ensemble (1)

boosting (1)

gradient (1)

semi-supervised learning (1)

document classification (1)

graphs (1)

variables (1)

linear (1)

k-NN (1)

Syndicate
Atom 1.0 RSS

About

Italian Association for Machine Learning (C.F. 97949550582)

Write us: info@iaml.it

Address

Operational office IAML c/o Pi Campus, via Indonesia 23, 00144 Rome Legal office Via Cassia 964, 00189, Rome

Quick Links

Home

Documents (Italian)

Copyright 2018 IAML.IT. All Rights Reserved.

Powered by Grav

Theme (adapted) from Deliver