CS365 Course Project Billion Word Imputation Guide: Prof. Amitabha Mukherjee Group 20: Aayush Mudgal...

CS365 Course Project

Billion Word Imputation

Guide: Prof. Amitabha Mukherjee

Group 20:Aayush Mudgal [12008]Shruti Bhargava [13671]

Problem Statement

Problem Description : https://www.kaggle.com/c/billion-word-imputation

Examples :

1. “Michael described Sarah to a at the shelter .”◦ “Michael described Sarah to a __________? at the shelter.

2. “He added that people should not mess with mother nature , and let sharks be .”

Basic Approach

Location ? Word ??

1. Language modelling using Word2Vec2. Strengthening using HMM / NLP Parser

Skip Gram VS N Gram• Data is Sparse

• Example Sentence : “I hit the tennis ball”

•Word level trigrams: “I hit the”, “hit the tennis” and “the tennis ball”

•But skipping the word tennis, results in an equally important trigram

Word as Atomic Units

Distributed Representation

Word2vec by Mikolov et al.(2013)

Two architectures

1. Continuous Bag-of-Word ◦ Predict the word given the context

2. Skip Gram◦ Predict the context given the word◦ The training objective is to find word representations that are useful for predicting the surrounding

words in a sentence or a document

Skip Gram Method

Given a sequence of training words w1, w2, w3, . . . , wT , the objective of the Skip-gram model is to maximize the average log probability :

c is the size of the training context (which can be a function of the center word wt)

Skip Gram Method The basic Skip-gram formulation defines p( |) using the softmax function

where and are the “input” and “output” vector representations of wW is the number of words in the vocabulary. IMPRACTICAL because the cost of computing log p(wO|wI ) is proportional to W, which is often large ∇(105–107 terms).

Sub-Sampling of Frequent Words• The most frequent words like “in”, “the”, “a” can easily occur hundreds of millions of times (e.g., “in”, “the”, and “a”).

• Such words usually provide less information value than the rare words

•Example : Observation of France and Paris is much more beneficial• Than the frequent occurrence of “France” and “the”

• Vector representation of frequent words do not change significantly after training on several million examples

Skip-Gram Model : Limitation

• Word representations are limited by their inability to represent idiomatic phrases that are not compositions of the individual words.

• Example, “Boston Globe” is a newspaper, and not “Boston” + “Globe”

Therefore, using vectors to represent the whole phrases makes the Skip-gram model considerably more expressive.

Questions ?

Refrences 1. Mikolov, Tomas, et al. "Distributed representations of words and phrases and their compositionality." Advances in Neural Information Processing Systems. 2013.

2. Mnih, Andriy, and Koray Kavukcuoglu. "Learning word embeddings efficiently with noise-contrastive estimation." Advances in Neural Information Processing Systems. 2013.

3. A Closer Look at Skip-gram Modelling David Guthrie, Ben Allison, W. Liu, Louise Guthrie, and Yorick Wilks. Proceedings of the Fifth international Conference on Language Resources and Evaluation (LREC-2006), Genoa, Italy, (2006)

4. Mikolov, Tomas, et al. "Efficient estimation of word representations in vector space." arXiv preprint arXiv:1301.3781 (2013).

Challenge Description and Data : https://www.kaggle.com/c/billion-word-imputation

Hidden Markov Models1. States : Parts of Speech2. Combine Word2Vec with HMM

Skip-Gram Method◦ Vocabulary size is V◦ Hidden layer size is N◦ Input Vector : One-hot encoded vector, i.e. only one node of { is 1 and others 0◦ Weights between the input layer and the output layer is represented by a VxN matrix W

Skip-Gram Method• h= = • is the vector representation of the input word

• is the score of each word in vocabulary and is the j-th column of matrix

CS365 Course Project Billion Word Imputation Guide: Prof. Amitabha Mukherjee Group 20: Aayush Mudgal...

Documents

Transcript of CS365 Course Project Billion Word Imputation Guide: Prof. Amitabha Mukherjee Group 20: Aayush Mudgal...

CS365 - HCI

Unfair Practices Regulations - Legal Talklegaltalk.co.za/Unfair-Practices-Regulations-Rental-Housing-Act.pdf · rn.... staatskoerant, 14 maart 2008 12008 .... 2008 no. 30863 109 rental

Application of NEAT algorithm in PC Games - IITKhome.iitk.ac.in/~rankur/cs365/project/report.pdf · Report Application of NEAT algorithm in PC Games Submitted in partial ful llment

Power&Laws&in&Biology& - University of New Mexico › ~forrest › classes › cs365 › lectures › ... · 2014-08-27 · Power&Laws&in&Biology& Stephanie&Forrest Introduc9on&to&Scien9ﬁc&Modeling&

Introduction to Scientiﬁc Modelingforrest/classes/cs365/lectures/...References" • J. H. Holland Adaptation in Natural and Artiﬁcial Systems. Univ. of Michigan Press (1975). Second

PSU- CS 365 Preliminaries CS365 HCI & IS330 GUI Design A. Sameh Fall 2011-2012 Lecture Set 1.

I , 121 12008 · KINONDONlMUNICIPAL COUNCIL MUNICIPAL DIRECTOR'S OFFICE I , KINONDONI MUNICIPAL COUNCIL 121 q12008 BUILDING PERMIT (BP) NI! 05776 ctric Wires or …

Aono 12008

Exploring Effects of Intrinsic Motivation in Reinforcement ...nitish/iitk_page/cs365/project/ppt.pdf · RL FrameworkQ-LearningTemporal Abstractions : OptionsModeling Intrinsic MotivationResults

PARTS DIAGRAM FOR · 2020. 11. 16. · 25 12008-11 Brush spacer bushing MAG-12008 Upper Parts Key Key # Part # Description 26 12008-09 Bearing block adjustment plate 27 12008-24 Die

Synthesis of Leucascandrolide A via a Spontaneous Macrolactolization J. Am. Chem. Soc. 2002, 124, 13670-13671 Presented by Vijayarajan Devannah 3/19/2013.

Advanced ALU Lecture 5 CS365. D. Barbara Advanced ALU CS465 F08 2 RoadMap Implementation of MIPS ALU Signed and unsigned numbers (Section 3.2) Addition.

Sovereignty through the Inter-Disciplinary Kaleidoscopebura.brunel.ac.uk/bitstream/2438/12008/1/Fulltext.pdf · Sovereignty through the Inter-disciplinary ... The Word Sovereignty

I Siw1~~~~= - Kansasethics.ks.gov/CFAScanned/Senate/2016ElecCycle/201501/S08JD_2015… · Terrence Dunn 12008 Ensley Lane Leawood, ... Jim : A. Denning : SCHEDULE A ... Merck, Sharp

Dibya Ranjan (10243) Vishal Kumavat(10818) Advisor: Prof. Amit … · 2013-05-27 · Dibya Ranjan (10243) Vishal Kumavat(10818) Advisor: Prof. Amit Mukerjee CS365- Artificial Intelligence

Chess Playing Robot using Lego Mindstorms - IIT …home.iitk.ac.in/~ragupta/cs365/project/report.pdf · Chess Playing Robot using Lego Mindstorms Raghav Gupta Nikhil Agarwal Mentor:

Pages 30-52 (2020-12008)

it.delhigovt.nic.init.delhigovt.nic.in/writereaddata/Cir2017751235.pdfLoad Carrying Capacity, kg TO be verified after body building 13471 13571 11371 11201 12086 13206 13671 12111

Z 8317-12008

Agriculture, Environmental Services and Agro-Tourism …ageconsearch.umn.edu/bitstream/12008/1/01010087.pdf · 2017-04-01 · Agriculture, Environmental Services and Agro-Tourism