<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE internships PUBLIC "GROUP DTD" "http://dbweb.enst.fr/group.dtd">
<internships xmlns:h="http://www.w3.org/1999/xhtml">
 <internship>
    <title>PhD Position on Machine Learning for the Web Graph &amp; Social networks</title>
    <proposed_by>
      <internal_ref ref="michalis"/>
    </proposed_by>
    <length>3 years</length>
    <level>PhD</level>
    <abstract>
    <h:p>Applications are invited for a Ph.D. position, starting beginning of 2012,
in the context of the DIGITEO project LEVETONE focusing on advanced Web Mining research.
The project is joint among top French Universities in the context of the ParisTech alliance with the support
of local industrial partners. The project is run and supervised by <h:a href="http://www.lix.polytechnique.fr/~mvazirg/">Prof. M. Vazirgiannis</h:a>.</h:p>

<h:p>The Research area is “Machine Learning for the Web Graph &amp; Social networks”. More specifically the research is focused along the following axes:</h:p>
<h:ul>
<h:li>Advanced methods for graph mining in social networks</h:li>
<h:li>Real time personalization for mobile devices</h:li>
</h:ul>
<h:h3>Requirements</h:h3>
<h:p>Prospective applicants should have</h:p>
<h:ul>
<h:li>a B.Sc. and a Master degree in the following areas: Mathematics, Physics, Computer Science/Engineering</h:li>
<h:li>experience in mathematical programming and relevant tools</h:li>
<h:li>analytical skills and creative thinking with a hard working attitude</h:li>
<h:li>a very good command in English (proved by relevant internationally approved tests)</h:li>
</h:ul>
 <h:p><h:strong>Funding:</h:strong> Full funding for 3 years is available.</h:p>
  <h:h3>Applications</h:h3>
 <h:p>Interested graduate students should send <h:strong>by email</h:strong></h:p>
 <h:ul>
 <h:li>a cover letter including a brief presentation of their academic record, the motivation and the skills of the candidate</h:li>
 <h:li>a full CV</h:li>
 <h:li>a list of at least 2 academic/industrial references (names and contact information only, not the actual letters)</h:li>
 </h:ul>
 <h:p>The above should be e-mailed in a compressed file named after your surname (i.e., &lt;surname&gt;.rar) to <h:a href="mailto:mvazirg@lix.polytechnique.fr">Prof. M. Vazirgiannis</h:a>.</h:p>
<h:h3>Location</h:h3>
<h:p>This position is joint between LIX/École polytechnique and the INFRES department of Télécom ParisTech, members of the ParisTech alliance. See further details at <h:a href="http://www.lix.polytechnique.fr/~mvazirg/">Prof. M. Vazirgiannis's Web page</h:a>.</h:p>
</abstract>
</internship>
 <internship>
    <title>Post-doctoral position on Machine Learning for the Web Graph &amp; Social networks</title>
    <proposed_by>
      <internal_ref ref="michalis"/>
    </proposed_by>
    <length>1 year</length>
    <level>PostDoc</level>
    <abstract>
    <h:p>Applications are invited for a Post-doc position, available in the period 2011-2013,
in the context of the DIGITEO project LEVETONE focusing on advanced Web Mining research.
The project is joint among top French Universities in the context of the ParisTech alliance with the support
of local industrial partners. The project is run and supervised by <h:a href="http://www.lix.polytechnique.fr/~mvazirg/">Prof. M. Vazirgiannis</h:a>.</h:p>

<h:p>The Research area is “Machine Learning for the Web Graph &amp; Social networks”. More specifically the research is focused along the following axes:</h:p>
<h:ul>
<h:li>Advanced methods for graph mining in social networks</h:li>
<h:li>Real time personalization for mobile devices</h:li>
</h:ul>
<h:h3>Requirements</h:h3>
<h:p>Prospective applicants should have</h:p>
<h:ul>
<h:li>a recent Ph.D. degree in the following areas: Computer Science/Engineering, Computational Mathematics, Physics</h:li>
<h:li>experience in data management, mathematical programming and relevant tools</h:li>
<h:li>analytical skills and creative thinking with a hard working attitude</h:li>
<h:li>a sound publication record.</h:li>
</h:ul>
 <h:p><h:strong>Funding:</h:strong> Full funding for up to 12 months is available.</h:p>
 <h:h3>Applications</h:h3>
 <h:p>Interested graduate students should send <h:strong>by email</h:strong></h:p>
 <h:ul>
 <h:li>a cover letter including a brief presentation of their academic record, the motivation and the skills of the candidate</h:li>
 <h:li>a full CV</h:li>
 <h:li>a list of at least 2 academic/industrial references (names and contact information only, not the actual letters)</h:li>
 </h:ul>
 <h:p>The above should be e-mailed in a compressed file named after your surname (i.e., &lt;surname&gt;.rar) to <h:a href="mailto:mvazirg@lix.polytechnique.fr">Prof. M. Vazirgiannis</h:a>.</h:p>
<h:h3>Location</h:h3>
<h:p>This position is joint between LIX/École polytechnique and the INFRES department of Télécom ParisTech, members of the ParisTech alliance. See further details at <h:a href="http://www.lix.polytechnique.fr/~mvazirg/">Prof. M. Vazirgiannis's Web page</h:a>.</h:p>
</abstract>
</internship>
 <internship>
    <title>What make probabilistic data efficiently queriable?</title>
    <proposed_by>
      <internal_ref ref="pierre"/>
    </proposed_by>
    <length>6 months</length>
    <level>MSc</level>
    <abstract>
      Probabilistic databases are compact representations of probability distributions over regular
      databases. A number of models have been proposed for probabilistic
      data, both in the relational and the XML settings. Evaluating a Boolean query
      over a probabilistic database
      amounts to computing the probability that this Boolean query
      matches in the probability distribution. One crucial question is whether query evaluation remains
      tractable on probabilistic
      databases.
      A number of research works have looked at characteristics of
      queries that may make them
      tractable: thus, queries without self-joins are tractable over
      tuple-independent databases if
      and only if they are hierarchical, while tree-pattern queries
      with a single join are tractable
      if and only if they are equivalent to a join-free query.
      The objective of this internship is to take the problem from the
      other side: identifying
      classes of data for which queries are tractable. One direction is
      to look at bounding the
      treewidth of the data; another is to try ﬁnding join patterns
      that make querying easy.
    </abstract>
    <refs>
      <ref type="Description of the internship" href="webdam_msc_internship.pdf"/>
    </refs>
  </internship>
   <internship>
    <title>Social web exploration</title>
    <proposed_by>
      <internal_ref ref="georges"/>
      <internal_ref ref="pierre"/>
    </proposed_by>
    <length>2–6 months</length>
    <level>MSc or Engineering Student</level>
    <abstract>
     <h:p>The intern will be a part of the <h:a href="http://arcomem.eu/">ARCOMEM</h:a> project, a 3-year European project bringing together twelve private and public partners from seven different countries, in order to build an intelligent social web archiving tool. More specifically, he will join our team working on new approaches to web exploration. The main project the intern will be involved in concerns the exploration (crawling) of the social web.</h:p>
<h:p>On social networks, standard HTML crawling is not always possible and it is sometimes compulsory or more convenient to get the data from specific service calls (APIs). However, these APIs usually have restrictive limitations, in terms of the number of requests per hour. In the perspective of archiving social data, we want to address this challenging research problem: how to optimize the amount of data that can be accessed under specific API constraints? This will be the main research project the intern will work on.</h:p>
<h:p>Other options related to ARCOMEM are open too. We are developing a framework to simplify interactions with multiple APIs and we face interesting technical issues. There are also questions on the use of social data in the integrated project. Web crawling usually consists in recursively exploring all the different links extracted from web pages; for ARCOMEM, we are developing intelligent modules to prioritize web pages that are deemed interesting relatively to their content and context, in particular the social context.</h:p>
<h:p>We hope this gives you some ideas of the research and development challenges. The work on the crawler has already resulted in many exciting innovations and the many dimensions of the ARCOMEM project opens the possibility of various research opportunities.</h:p>
<h:h3>The ideal candidate</h:h3>
<h:p>We are very open-minded about applications, our only firm requirement is some previous programming experience. The main qualities we are looking for are: interest in computer science, motivation, innovative thinking, openness to collaboration, and proactive mindset. Candidates are expected to be proficient in English.</h:p>
    </abstract>
  </internship>
</internships>

