Ayon Sen
Research Scientist
Meta
email: ayonsn at gmail dot com
About Me
I am a research scientist working at Meta. Before joining Meta, I completed my PhD at the Computer Sciences Department, University of Wisconsin-Madison under the supervision of Xiaojin (Jerry) Zhu. My research interests lie in the domain of data mining, machine learning and machine teaching. My PhD research focus was machine teaching which is the inverse problem to machine learning. In this scenario, we assume that a teacher knows the learning goal and wants to design an optimal (e.g., smallest) training set for a particular learner. Machine teaching has multiple applications in the fields of computer security, debugging, educational psychology etc. More detials about machine teaching can be found here.
Here is my resume.
Publications
- Martina Rau, Ayon Sen, Xiaojin Zhu: Using Machine Learning to Overcome the Expert Blind Spot for Perceptual Fluency Trainings. AIED 2019.
- Ayon Sen, Scott Alfeld, Xuezhou Zhang, Ara Vartanian, Yuzhe Ma, Xiaojin Zhu: Training Set Camouflage. Gamesec 2018. [pdf]
- Ayon Sen, Purav Patel, Martina A. Rau, Blake Mason, Robert Nowak, Timothy T. Rogers, Xiaojin Zhu: Machine Beats Human at Sequencing Visuals for Perceptual-Fluency Practice. EDM 2018. [pdf]
- Ayon Sen, Purav Patel, Martina A. Rau, Blake Mason, Robert Nowak, Timothy T. Rogers, Xiaojin Zhu: For Teaching Perceptual Fluency, Machines Beat Human Experts. CogSci 2018.
- Ayon Sen, Sheeraz Ahmad: Learning Substitutes Relationship with Deep Ranking. AMLC 2018.
- Md. Mahbubul Hasan, A. S. M. Sohidull Islam, Ayon Sen: Palindromic Subsequence Automata and Longest Common Palindromic Subsequence. MCS 2017.
- Xuezhou Zhang, Hrag Ohannessian, Ayon Sen, Scott Alfeld, Xiaojin Zhu: Optimal Teaching for Online Perceptrons. NIPS Constructive Machine Learning Workshop 2016. [pdf]
- Ayon Sen, Md. Monirul Islam, Kazuyuki Murase, Xin Yao: Binarization With Boosting and Oversampling for Multiclass Classification. IEEE T. Cybernetics 2015.
- Ayon Sen, A. S. M. Sohidull Islam, Md. Yusuf Sarwar Uddin: MARQUES: Distributed multi-attribute range query solution using space filling curve on DTHs. NSysS 2015.
- Ayon Sen, Md. Monirul Islam, Kazuyuki Murase: An algorithmic framework based on the binarization approach for supervised and semi-supervised multiclass problems. IJCNN 2014.
- Ayon Sen, Tahmid-un-Nabi, Pritom Ahmed, Rezwana Reaz, A. S. M. Sohidull Islam, M. Sohel Rahman: A Modified Scatter Search Approach to Solve the DNA Fragment Assembly Problem. MIC 2013.
- Md. Mahbubul Hasan, A. S. M. Sohidull Islam, Ayon Sen: On Palindromic Sequence Automata and Applications. ICIAA 2013.
Experience
Research Scientist
Organization: Meta
Period: August 2020 - Now
Most of projects revolve around integrity. In this domain I have combined multimodal (e.g., text, image etc.) content detection with behavioral signals to identify bad actors on the platform.
PhD Intern
Organization: Facebook
Period: May 2019 - Aug 2019
I worked on the Pages Integrity team on the Business Impersonation Detection problem. Impersonation is a serious problem that often acts as a gateway to a variety of other issues such as privacy violations, data breach, phishing attacks, scams, misinformation etc. causing heavy damage to both users and Facebook. I focused on building the backend infrastructure for realtime impersonation detection based on text. In particular I used different text matching techniques (including a Siamese DNN) that are robust to adversarial obfuscations. My efforts helped increase the coverage of impersonation detection by around 20% with improved precision.
Applied Scientist Intern
Organization: Amazon
Period: May 2017 - Aug 2017
I worked on the Core Machine Learning team on the substitutes recommendation problem. In this system substitutes of products are suggested to the customers. This is an integral problem in Amazon across several businesses. For my project I designed a deep neural network based model which does not use behavioral data (from customers) or hand tuned features. Hence the model is easily applicable to newer or unpopular products. The model tries to learn an embedding for the products (from textual and image features) which can be used to find other substitutes. In particular the model was trained using a triplet architecture i.e., each input contained features for three products. Our experiments suggested that the model performed better than the production model being used at the time.
SDE Intern
Organization: Amazon
Period: May 2016 - Aug 2016
For my Internship project, I worked for Reading and Behavior Analytics under Kindle Reader Engineering. My task was to design a system, which facilitates MapReduce job without having to write any MapReduce code. The key of the job was fixed before hand. The values for job (basic data types like long, double, string etc and complex data types like objects and lists are supported) are specified using a plain text file in simple format. The main advantages are:
- Writing a new MapReduce task without actually writing any code
- This saves a lot of time also
- Multiple sources (of different types like Sequence files and text files) and multiple destination types are supported
- Output is stored in JSON format. Thus we can create an external table on top of it to run complex querie.
SDE Intern
Organization: Amazon
Period: May 2015 - Aug 2015
I worked for the Xray team under Kindle Reader Engineering. My project involved developing an end-to-end computer vision system to automatically train and evaluate classifiers and use the trained classifiers to classify images. The system was versatile to accommodate different types of classifiers.
Lecturer
Organization: Department of Computer Science, Bangladesh University of Engineering and Technology
Period: May 2012 - Aug 2014
Courses instructed: Object Oriented Programming Language, Software Development and Information System Design, Database, Computer Architecture
Education
- Master of Science (MSc), Computer Science and Engineering, Bangladesh University of Engineering and Technology, 2014
- Bachelor of Science (BSc), Computer Science and Engineering, Bangladesh University of Engineering and Technology, 2012