“Networks” result page : הפקולטה למדעי הנתונים וההחלטות

Understanding and Enhancing Deep Neural Networks with Automated Interpretability

Understanding and Enhancing Deep Neural Networks with Automated Interpretability

Abstract: Deep neural networks are becoming incredibly sophisticated; they can generate realistic images, engage in complex dialogues, analyze intricate data, and execute tasks that appear almost human-like. But how do such models achieve these abilities? In this talk, I will present a line of work that aims to explain the behaviors of deep neural networks.… Continue Reading Understanding and Enhancing Deep Neural Networks with Automated Interpretability

Continue Reading Understanding and Enhancing Deep Neural Networks with Automated Interpretability

Bayesian Persuasion in Networks: Divisibility and Network Irrelevance

Continue Reading Bayesian Persuasion in Networks: Divisibility and Network Irrelevance

Predicting and Analyzing High-Level Cognitive Traits Using Computational Multiplex Networks and Vector Representations

Continue Reading Predicting and Analyzing High-Level Cognitive Traits Using Computational Multiplex Networks and Vector Representations

The edge-averaging process on graphs with random initial opinions

Abstract: In several settings (e.g., sensor networks and social networks), nodes of a graph are equipped with initial opinions, and the goal is to estimate the average of these opinions using local operations. A natural algorithm to achieve this is the edge-averaging process, where edges are repeatedly selected at random (according to independent Poisson clocks)… Continue Reading The edge-averaging process on graphs with random initial opinions

Two Lenses on Deep Learning: Data Reconstruction and Transformer Structure – Job Talk

Abstract: Despite the remarkable success of modern deep learning, our theoretical understanding remains limited. Many fundamental questions about how these models learn, what they memorize, and what their architectures can express are still largely open. In this talk, I focus on two such questions that offer complementary perspectives on the behavior of modern networks. First, I examine how… Continue Reading Two Lenses on Deep Learning: Data Reconstruction and Transformer Structure – Job Talk

Interpreting the Inner Workings of Vision Models

Abstract: In this talk, I present an approach for interpreting the internal computation in deep vision models. I show that these interpretations can be used to detect model bugs and to improve the performance of pre-trained deep neural networks (e.g., reducing hallucinations from image captioners and detecting and removing spurious correlations in CLIP) without any… Continue Reading Interpreting the Inner Workings of Vision Models

Fundamentals of Aligning General-Purpose AI – Job Talk

Abstract: The field of artificial intelligence (AI) is undergoing a paradigm shift, moving from neural networks trained for narrowly defined tasks (e.g., image classification and machine translation) to general-purpose models such as ChatGPT. These models are trained at unprecedented scales to perform a wide range of tasks, from providing travel recommendations to solving Olympiad-level math problems.… Continue Reading Fundamentals of Aligning General-Purpose AI – Job Talk