Random commentary about Machine Learning, BigData, Spark, Deep Learning, C++, STL, Boost, Perl, Python, Algorithms, Problem Solving and Web Search
Wednesday, January 26, 2011
Urls and costs
You have a certain numbers of URLs N, grouped by domain. Each URL u_j has an associated weight w_j. You want to pick X URLs from N, but when you pick the theurl u_j in d_i then you need to pick all the urls for that domain. How can you pick exactly X urls and minimize the associated cost?