Personalization versus Filter Bubble: The influence of personalization on the quality of search queries

With growing amount of data on the Internet, the need for information retrieval systems i.e. search engines is inarguable. Moreover, to efficiently search such huge volumes of data, these search engines use numerous smart techniques and algorithms. Perhaps the most famous example is the PageRank algorithm to quantify the importance of a webpage. However, it […]

Dr. K. C मरे के होला ?

विदेश जान पाइन्छ कि भनि तयारी मा छु म, समाचार धेरै सुन्या छैन अचेल, साधु बुडा लडेको १९ दिन पुगी सकेछ, फसेबुके मित्र हरु खासै कराको सुनीएन यसपाली, सायद म जस्तै बेस्त छन् होला कतै | काठमाडौँ बाट ७०० कि.मि पर सुनसरीमा बस्छन् मेरा ६८ बर्षे बा अनि ५५ बर्षे आमा| सुन्दैछु घरमा गाई लडेको […]

Introduction to Data warehouse, OLAP and OLTP | Slides

A data warehouse is a relational database that is designed for query and analysis rather than for transaction processing. It usually contains historical data derived from transaction data, but it can include data from other sources. -Oracle Docs These slides contains simple information about data warehouse along with its simple application. Feel free to edit […]

Speaker Recognition using Gaussian Mixture Model

These presentation slides contains, introduction to Gaussian mixture model and its application in speaker recognization. Below I have listed some of the wonderful references for GMM References: D. A. Reynolds and R. C. Rose, “Robust Text- Independent Speaker Identification Using Gaussian Mixture Speaker Models”, IEEE Trans. on Speech and Audio Processing, vol.3, No.1, pp.72-83,January 1995. http://en.wikipedia.org/wiki/Probability_density_function […]

Running extrnal python lib like (NLTK) with hadoop streaming

NLTK is a leading platform for building Python programs to work with human language data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as WordNet, along with a suite of text processing libraries for classification, tokenization, stemming, tagging, parsing, and semantic reasoning, wrappers for industrial-strength NLP libraries – NLTK Documentation Hadoop […]