Q4 2013 Research: Public Company use of social media for IR - Part 3: YouTube & SlideShare
IR 670 Youtube Search Optimization
description
Transcript of IR 670 Youtube Search Optimization
![Page 1: IR 670 Youtube Search Optimization](https://reader036.fdocuments.us/reader036/viewer/2022062518/568148d3550346895db5ee5a/html5/thumbnails/1.jpg)
IR 670 Youtube Search Optimization
Spring 2010 - Prof. Caverlee
Spencer Huang, Jue Yin, Chia-Chun Lin
• When performing a Youtube search, regardless login or not, does not take a Youtube user’s preference into consideration (apply to Google video search also).
• Even if there is such a tool, no idea what’s in the box.
![Page 2: IR 670 Youtube Search Optimization](https://reader036.fdocuments.us/reader036/viewer/2022062518/568148d3550346895db5ee5a/html5/thumbnails/2.jpg)
Notations
• VideoEntity(v) - (t in v.title) + (t in v.description) + (t
in v.category) + (t in v.tag)
• UserEntity(u) - (t in u.fav) + (t in u.upload) + (t in
u.subcrption u’) u’, where u’ are terms in u’.fav and u’.upload; u could have multiple u’(s) that u subscribes to.
• User would input Youtube Id and query (at least), which would enable the generation of above entities.
![Page 3: IR 670 Youtube Search Optimization](https://reader036.fdocuments.us/reader036/viewer/2022062518/568148d3550346895db5ee5a/html5/thumbnails/3.jpg)
Approach and Feature
• Reorder via Vector-Space with stop words and with result retaining
• Support features from original Youtube/Google video search.
• Add New features such as by author and able/disable restrict content.
Crawl UserEntity
from Youtube Id and
VideoEntity of query
Stop words removal,
construct tf-idf for U and V, cache U for
re-use
Cosine-Similarity and
reorder
Display optimized
result
![Page 4: IR 670 Youtube Search Optimization](https://reader036.fdocuments.us/reader036/viewer/2022062518/568148d3550346895db5ee5a/html5/thumbnails/4.jpg)
Results• Interface
Ex: Say my Youtube
Id has 2 favorite
videos….
![Page 5: IR 670 Youtube Search Optimization](https://reader036.fdocuments.us/reader036/viewer/2022062518/568148d3550346895db5ee5a/html5/thumbnails/5.jpg)
Results (cont.)
![Page 6: IR 670 Youtube Search Optimization](https://reader036.fdocuments.us/reader036/viewer/2022062518/568148d3550346895db5ee5a/html5/thumbnails/6.jpg)
Results (cont.)
![Page 7: IR 670 Youtube Search Optimization](https://reader036.fdocuments.us/reader036/viewer/2022062518/568148d3550346895db5ee5a/html5/thumbnails/7.jpg)
Evaluation and Analysis• Conduct user study, ask each user to query and to
rank results’ relevance as baseline. Compare Youtube’s order of relevance vs our Vector approach.
NDCG@50 NDCG@20 NDCG@100
0.1
0.2
0.3
0.4
0.5
0.6
0.7
NDCG
Youtube
Ours w/ subs
Ours w/o subs
![Page 8: IR 670 Youtube Search Optimization](https://reader036.fdocuments.us/reader036/viewer/2022062518/568148d3550346895db5ee5a/html5/thumbnails/8.jpg)
Evaluation and Analysis (cont.)
![Page 9: IR 670 Youtube Search Optimization](https://reader036.fdocuments.us/reader036/viewer/2022062518/568148d3550346895db5ee5a/html5/thumbnails/9.jpg)
Conclusion and Problems encountered
• The optimization effect is dependant on query term and profile size.
• Youtube caps quota for deployed GAE, Youtube would reject a burst of requests. No problem when running on local GAE.
![Page 10: IR 670 Youtube Search Optimization](https://reader036.fdocuments.us/reader036/viewer/2022062518/568148d3550346895db5ee5a/html5/thumbnails/10.jpg)
Possible To do Features and QA
• Support for Asian languages• Interface for feature descriptor: No need
of Youtube account.• Try it out: http://irytso.appspot.com/
End