张驰浩博士后:Towards Understanding the Terminal Phase of Training of Deep Neural Networks

删除或更新信息，请邮件至freekaoyan#163.com(#换成@)

本站小编 Free考研考试/2021-12-26

Academy of Mathematics and Systems Science, CAS
Colloquia & Seminars

Speaker:	张驰浩博士后，日本东京大学
Inviter:	张世华
Title:	Towards Understanding the Terminal Phase of Training of Deep Neural Networks
Time & Venue:	2021.10.28 08:00-08:40 S525
Abstract:	Modern practice for training classification deepnets involves a Terminal Phase of Training (TPT), which begins at the epoch where training error first vanishes; During TPT, the training error stays effectively zero while training loss is pushed towards zero. Vardan Papyan et al. characterizes the TPT as Neural Collapse (NC), involving four deeply interconnected phenomena: (NC1) Cross-example within-class variability of last-layer training activations collapses to zero, as the individual activations themselves collapse to their class-means; (NC2) The class-means collapse to the vertices of a Simplex Equiangular Tight Frame(ETF); (NC3) Up to rescaling, the last-layer classifiers collapse to the class-means, or in other words to the Simplex ETF, i.e. to a self-dual configuration; (NC4) For a given activation, the classifier's decision collapses to simply choosing whichever class has the closest train class-mean, i.e. the Nearest Class-Center (NCC) decision rule. However, the NC described by Vardan Papyan et al. focuses on the behaviors of the last layer of deepnets; the behaviors of the deepnets' intermediate layers is still unclear. In this talk, I will briefly introduce the NC phenomena and discuss the future direction towards understanding the TPT of deepnets by investigating the behaviors of the intermediate layers.

相关话题/博士后 日本东京

领限时大额优惠券,享本站正版考研考试资料!
优惠券领取后72小时内有效，10万种最新考研考试考证类电子打印资料任你选。涵盖全国500余所院校考研专业课、200多种职业资格考试、1100多种经典教材，产品类型包含电子书、题库、全套资料以及视频，无论您是考研复习、考证刷题，还是考前冲刺等，不同类型的产品可满足您学习上的不同需求。 ...
考试优惠券本站小编 Free壹佰分学习网 2022-09-19
张驰浩博士后:Probabilistic Matrix Factorization Methods for Complicated Noise Modeling
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
张驰浩博士后:Robust Bayesian Matrix Decomposition with Mixture of Gaussian Noise
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
张驰浩博士后: Information-theoretic Classification Accuracy: A Criterion that Guides Data-driven Combinat
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
张驰浩博士后:Matrix Normal PCA for Interpretable Dimension Reduction and Graphical Noise Modeling
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
张驰浩博士后:Bayesian Joint Matrix Decomposition for Data Integration with Heterogeneous Noise
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
张驰浩博士后:Distributed Bayesian Matrix Decomposition for Big Data Mining and Clustering
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
王宇辰博士后:STEADY CONCENTRATED VORTICITIES OF THE 2-D INCOMPRESSIBLE EULER EQUATION ON SMOOTH BOUNDED DO
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
廖家江博士后：Zero-viscosity limit of the incompressible Navier-Stokes equations with sharp vorticity grad
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
赵娜博士后：Decay and Vanishing of some D-Solutions of the Navier-Stokes Equations
Academy of Mathematics and Systems Science, CAS Colloquia & Seminars ...
中科院数学与系统科学研究院本站小编 Free考研考试 2021-12-26
中央音乐学院音乐与舞蹈学博士后科研流动站被评估为优秀
近日，我校收到全国博士后管理委员会通知，我校音乐学研究所在2020年度博士后工作综合评估中被评估为“优秀”等级，这是该流动站设立以来首次被评为“优秀”等级。此次全国共有434个博士后流动站被评估为优秀，我校为其中唯一音乐与舞蹈学博士后流动站。　　据悉，这是人力资源和社会保障部和全国博士后管理委员会为 ...
中央音乐学院本站小编 Free考研考试 2021-12-26

张驰浩 博士后:Towards Understanding the Terminal Phase of Training of Deep Neural Networks

本站小编 Free考研考试/2021-12-26

相关话题/博士后 日本东京

张驰浩博士后:Towards Understanding the Terminal Phase of Training of Deep Neural Networks

相关话题/博士后日本东京