<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">

    <title type="text">Liwa : News</title>
    <subtitle type="text">News:</subtitle>
    <link rel="alternate" type="text/html" href="http://liwa-project.eu/index.php/news/" />
    <link rel="self" type="application/atom+xml" href="http://liwa-project.eu/index.php/site/atom/" />
    <updated>2011-04-22T12:53:32Z</updated>
    <rights>Copyright (c) 2011, Nathalie</rights>
    <generator uri="http://expressionengine.com/" version="1.6.4">ExpressionEngine</generator>
    <id>tag:liwa-project.eu,2011:04:22</id>


    <entry>
      <title>Press release</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2011:index.php/news/4.208</id>
      <published>2011-01-27T08:31:50Z</published>
      <updated>2011-03-10T16:29:51Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>Internet pages are like stars in the sky: They are uncountable many, and every day new appear. They bring new texts, information, and pictures into existence, some of which only exist in the Internet. But who is to decide which pages are worth to preserve? Libraries and archives are currently rather helpless to deal with such gigantic amounts of data. With the current state of technology, it is not feasible to review and select all material for archiving. But this is about to change.
<br />
Two European projects will help to preserve the digital cultural heritage of the World Wide Web. The &#8220;Archive Community Memories&#8221; project focuses on automatic selection of web-content that is socially relevant. A new archiving method will not specifically search for topics or events in the Web and rate their importance. To achieve this not only Web pages of organizations or companies are evaluated, but also private content like publicly accessible blogs or social networks like Facebook. Social networks can be very helpful in discovering important Web pages, as users will suggest such pages to their friends. By harnessing such and other information, the project will help to optimize and to speed-up the reviewing process of national libraries or archives.
<br />
The total size of the EU project is eight million Euro. L3S Research Center at Leibniz University Hannover receives one million euro, and leads the scientific management. The overall management is led by researchers from University of Sheffield. There are also several other partners involved in the project, like Yahoo!, Südwestrundfunk, and Deutsche Welle.
<br />
The project is a follow-up of &#8220;Living Web Archives&#8221;, in which researchers from L3S Research Center and other European partners have been working in the past years. The goal was to improve the quality of Web archives, especially regarding multi-media content, spam detection, as well as enabling the use of the archive for future generations.
</p> 
      ]]></content>
    </entry>

    <entry>
      <title>LiWA technologies released in Open Source</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2010:index.php/news/4.152</id>
      <published>2010-08-30T07:18:22Z</published>
      <updated>2011-03-10T16:29:23Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Archive Fidelity"
        scheme="http://liwa-project.eu/index.php/site/news/C5/"
        label="Archive Fidelity" />
      <category term="Spam Cleansing"
        scheme="http://liwa-project.eu/index.php/site/news/C4/"
        label="Spam Cleansing" />
      <category term="Temporal Coherence"
        scheme="http://liwa-project.eu/index.php/site/news/C6/"
        label="Temporal Coherence" />
      <category term="Semantic Evolution"
        scheme="http://liwa-project.eu/index.php/site/news/C7/"
        label="Semantic Evolution" />
      <category term="Social Web"
        scheme="http://liwa-project.eu/index.php/site/news/C8/"
        label="Social Web" />
      <category term="Rich Media"
        scheme="http://liwa-project.eu/index.php/site/news/C9/"
        label="Rich Media" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <content type="html"><![CDATA[
        <p>They are all grouped under the “liwa-technologies” project on Google code:
<br />
<a href="http://code.google.com/p/liwa-technologies/" target="new">http://code.google.com/p/liwa-technologies/</a>.
</p>
<p>
1° The Rich Media Capture Module - a plug-in dedicated to the capture of streaming video content:
<br />
<a href="http://code.google.com/p/liwa-technologies/source/browse/rich-media-capture" target="new">http://code.google.com/p/liwa-technologies/source/browse/rich-media-capture</a>
<br />
<a href="http://code.google.com/p/liwa-technologies/downloads/detail?name=rich-media-capture-plugin-1.0.jar" target="new">http://code.google.com/p/liwa-technologies/downloads/detail?name=rich-media-capture-plugin-1.0.jar</a>
</p>
<p>
2° The Temporal Coherence Analyser - a plug-in dedicated to the analysis of the temporal coherence of the archived Web content:
<br />
<a href="http://code.google.com/p/liwa-technologies/source/browse/temporal-coherence" target="new">http://code.google.com/p/liwa-technologies/source/browse/temporal-coherence</a>
</p>
<p>
3° The Spam Assessment Interface - a Web service that enables the quality assessment of the archived Web content:
<br />
<a href="http://code.google.com/p/liwa-technologies/source/browse/assessment-interface" target="new">http://code.google.com/p/liwa-technologies/source/browse/assessment-interface</a>
</p>
<p>
4° The Semantic Analizer - a component dedicated to the detection of terminology evolution:
<br />
<a href="http://code.google.com/p/liwa-technologies/source/browse/SemanticAnalyser" target="new" >http://code.google.com/p/liwa-technologies/source/browse/SemanticAnalyser</a>
<br />
<a href="http://code.google.com/p/liwa-technologies/downloads/detail?name=SemanticAnalyser-1.0.zip" target="new">http://code.google.com/p/liwa-technologies/downloads/detail?name=SemanticAnalyser-1.0.zip</a>
</p>
<p>
5° The Web Archive UI Framework - a client-side framework that helps creating User Interface helpers for Web archive browsing:
<br />
<a href="http://code.google.com/p/liwa-technologies/source/browse/web-archive-ui-framework" target="new">http://code.google.com/p/liwa-technologies/source/browse/web-archive-ui-framework</a>
</p>
<p>
<br></br>To learn more about each component, the Google project provides also a wiki space, giving a brief description of each module and the necessary steps for its deployment: <a href="http://code.google.com/p/liwa-technologies/w/list" target="new">http://code.google.com/p/liwa-technologies/w/list</a>
</p>
<p>
<br> </br>You are all welcome to download and try out the LiWA components. Your feedback and comments will be greatly appreciated, helping us to improve the documentation and the usability of the technologies.
<br />

</p> 
      ]]></content>
    </entry>

    <entry>
      <title>LiWA Evolution Tracking Module released</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2011:index.php/news/4.321</id>
      <published>2011-04-22T12:42:31Z</published>
      <updated>2011-04-22T12:53:32Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Semantic Evolution"
        scheme="http://liwa-project.eu/index.php/site/news/C7/"
        label="Semantic Evolution" />
      <content type="html"><![CDATA[
        <p>The LiWA Terminology Evolution Tracking Module is a java module for Word sense evolution tracking, released under the “liwa-technologies” project on Google code:
<br />
<a href="http://code.google.com/p/liwa-technologies/downloads/detail?name=LiWAEvoTracking.zip&amp;can=2&amp;q=">http://code.google.com/p/liwa-technologies/downloads/detail?name=LiWAEvoTracking.zip&amp;can=2&amp;q=</a>
</p> 
      ]]></content>
    </entry>

    <entry>
      <title>LiWA Third Newsletter published</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2011:index.php/news/4.317</id>
      <published>2011-04-07T15:36:53Z</published>
      <updated>2011-04-07T15:49:54Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Archive Fidelity"
        scheme="http://liwa-project.eu/index.php/site/news/C5/"
        label="Archive Fidelity" />
      <category term="Spam Cleansing"
        scheme="http://liwa-project.eu/index.php/site/news/C4/"
        label="Spam Cleansing" />
      <category term="Temporal Coherence"
        scheme="http://liwa-project.eu/index.php/site/news/C6/"
        label="Temporal Coherence" />
      <category term="Semantic Evolution"
        scheme="http://liwa-project.eu/index.php/site/news/C7/"
        label="Semantic Evolution" />
      <category term="Social Web"
        scheme="http://liwa-project.eu/index.php/site/news/C8/"
        label="Social Web" />
      <category term="Rich Media"
        scheme="http://liwa-project.eu/index.php/site/news/C9/"
        label="Rich Media" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
         
      ]]></content>
    </entry>

    <entry>
      <title>The SHARC framework for data quality in Web archiving</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2011:index.php/news/4.230</id>
      <published>2011-03-10T16:38:41Z</published>
      <updated>2011-03-10T16:38:42Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Archive Fidelity"
        scheme="http://liwa-project.eu/index.php/site/news/C5/"
        label="Archive Fidelity" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>The <a href="http://www.springerlink.com/content/1066-8888" title="The SHARC framework for data quality in Web archiving" target="new">download</a> is available to download via online first in the VLDB Journal.
</p> 
      ]]></content>
    </entry>

    <entry>
      <title>Web spam classification: a few features worth more</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2011:index.php/news/4.227</id>
      <published>2011-03-10T16:25:40Z</published>
      <updated>2011-03-10T16:25:41Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Spam Cleansing"
        scheme="http://liwa-project.eu/index.php/site/news/C4/"
        label="Spam Cleansing" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>In this <a href="http://liwa-project.eu/images/publications/airweb2011.pdf">paper</a> we investigate how much various classes of Web spam features, some requiring very high computational effort, add to the classification accuracy. We realize that advances in machine learning, an area that has received less attention in the adversarial IR community, yields more improvement than new features and result in low cost yet accurate spam filters.
</p> 
      ]]></content>
    </entry>

    <entry>
      <title>Temporal Analysis for Web Spam Detection: An Overview</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2011:index.php/news/4.228</id>
      <published>2011-03-10T16:25:28Z</published>
      <updated>2011-03-10T16:26:29Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Temporal Coherence"
        scheme="http://liwa-project.eu/index.php/site/news/C6/"
        label="Temporal Coherence" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>In this <a href="http://liwa-project.eu/images/publications/twaw.pdf">paper</a> we give a comprehensive overview of temporal features devised for Web spam detection providing measurements for different feature sets.
</p> 
      ]]></content>
    </entry>

    <entry>
      <title>Language Evolution On The Go</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2011:index.php/news/4.226</id>
      <published>2011-03-10T16:24:46Z</published>
      <updated>2011-03-10T16:24:47Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Semantic Evolution"
        scheme="http://liwa-project.eu/index.php/site/news/C7/"
        label="Semantic Evolution" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
         
      ]]></content>
    </entry>

    <entry>
      <title>On the Applicability of Word Sense Discrimination on 201 Years of Modern English</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2011:index.php/news/4.225</id>
      <published>2011-03-10T16:22:56Z</published>
      <updated>2011-03-10T16:23:57Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Semantic Evolution"
        scheme="http://liwa-project.eu/index.php/site/news/C7/"
        label="Semantic Evolution" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>Word sense discrimination is the first, important step towards automatic detection of language evolution within large, historic document collections. By comparing the found word senses over time, we can reveal and use important information that will improve understanding and accessibility of a digital archive. Algorithms for word sense discrimination have been developed while keeping today’s language in mind and have thus been evaluated on well selected, modern datasets. The quality of the word senses found in the discrimination step has a large impact on the detection of language evolution. Therefore, as a first step, we verify that word sense discrimination can successfully be applied to digitized historic documents and that the results correctly correspond to word senses. Because accessibility of digitized historic collections is influenced also by the quality of the optical character recognition (OCR), as a second step we investigate the effects of OCR errors on word sense discrimination results. All evaluations in this <a href="http://liwa-project.eu/images/publications/jcdl2010-tahmasebi4.pdf">paper</a> are performed on The Times Archive, a collection of newspaper articles from 1785 - 1985.
</p> 
      ]]></content>
    </entry>

    <entry>
      <title>Talk at IWAW 2010</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2011:index.php/news/4.212</id>
      <published>2011-02-01T09:14:47Z</published>
      <updated>2011-02-01T09:17:48Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Rich Media"
        scheme="http://liwa-project.eu/index.php/site/news/C9/"
        label="Rich Media" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>In this presentation three use cases were presented:
</p>
<p>
    * preserve Dutch public broadcasting websites (preservation of Dutch cultural heritage)
<br />
    * collect Internet AV materials (mainly AV content that is broadcasted on the internet but not on traditional media)
<br />
    * preserve web context (to be used by archivists for looking up relevant context information for annotating Radio &amp; Television items)
</p> 
      ]]></content>
    </entry>

    <entry>
      <title>2nd LiWA Terminology Evolution Evaluation Workshop</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2010:index.php/news/4.205</id>
      <published>2010-12-29T10:31:11Z</published>
      <updated>2011-03-10T16:22:12Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Semantic Evolution"
        scheme="http://liwa-project.eu/index.php/site/news/C7/"
        label="Semantic Evolution" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>On December 15, 2010 the 2nd LiWA Terminology Evolution Evaluation Workshop will be held in Hanover, Germany at L3S Research Center. The workshop aims at evaluating terminology evolution found inside long term archives. The workshop attendees will also evaluate the performance of the Terminology Evolution Browser, a tool developed within LiWA to better visualize evolution.&nbsp;
</p> 
      ]]></content>
    </entry>

    <entry>
      <title>1st LiWA Terminology Evolution Evaluation Workshop</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2010:index.php/news/4.204</id>
      <published>2010-12-29T10:30:32Z</published>
      <updated>2010-12-29T10:32:33Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Semantic Evolution"
        scheme="http://liwa-project.eu/index.php/site/news/C7/"
        label="Semantic Evolution" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>The 1st LiWA Terminology Evolution Evaluation Workshop was held on March 16, 2010 at L3S Research Center, Hannover, Germany. The workshop spanned half a day and aimed at evaluating the outcome in LiWA WP5 technology.&nbsp;
</p> 
      ]]></content>
    </entry>

    <entry>
      <title>LiWA development mentioned at FIAT 2010</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2010:index.php/news/4.203</id>
      <published>2010-12-01T15:02:54Z</published>
      <updated>2010-12-01T15:20:55Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Archive Fidelity"
        scheme="http://liwa-project.eu/index.php/site/news/C5/"
        label="Archive Fidelity" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>This <a href="http://liwa-project.eu/images/videos/ATN_poster.png" onclick="window.open('http://liwa-project.eu/images/videos/ATN_poster.png','popup','width=605,height=852,scrollbars=no,resizable=yes,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false" target="new">poster</a> focused on:
<br />
- an example of a shared platform <a href="http://www.archivethe.net/en/" title="AtN" target="new">Archivethe.net</a> dedicated to heritage institutions
<br />
- archiving web video, its main issues and developments
</p> 
      ]]></content>
    </entry>

    <entry>
      <title>LiWA toolkit presented to IIPC Harvesting Working Group</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2010:index.php/news/4.202</id>
      <published>2010-11-12T09:17:18Z</published>
      <updated>2010-11-12T09:25:19Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>Radu Pop presented the LiWA tools released in open-source, during the Harvesting Working Group session of the <a href="http://netpreserve.org/events/vienna.php" target="new">IIPC meetings</a>.
<br />

</p> 
      ]]></content>
    </entry>

    <entry>
      <title>Language Evolution On The Go</title>
      <link rel="alternate" type="text/html" href="{url_title_path=news" />
      <id>tag:liwa-project.eu,2010:index.php/news/4.199</id>
      <published>2010-11-04T11:51:46Z</published>
      <updated>2010-11-04T11:52:47Z</updated>
      <author>
            <name>Nathalie</name>
            <email>nathalie@europarchive.org</email>
                  </author>

      <category term="Semantic Evolution"
        scheme="http://liwa-project.eu/index.php/site/news/C7/"
        label="Semantic Evolution" />
      <category term="General"
        scheme="http://liwa-project.eu/index.php/site/news/C22/"
        label="General" />
      <category term="Events"
        scheme="http://liwa-project.eu/index.php/site/news/C23/"
        label="Events" />
      <content type="html"><![CDATA[
        <p>Knowing about the evolution of a term can significantly decrease time needed for searching for information. It can also aid in quickly getting a broader overview, which is essential when one is on the move. In this <a href="http://liwa-project.eu/images/publications/LanguageEvolutionOnTheGo.pdf" title="LanguageEvolutionOnTheGo" target="new">paper</a> we present a solution for providing language evolution knowledge &#8220;on the go&#8221;. On the 3rd International Workshop on <a href="http://webhotel2.tut.fi/emmi/forum/node/56 " title="SAME 2010" target="new">Semantic Ambient Media Experience</a> 2010, November 10th in conjunction with <a href="http://www.ami-10.org/" title="AmI-10" target="new">AmI-10</a> in Malaga, Spain, the LiWA project will present a mobile interface for easy access and visualization as well as an overview of how this evolution was found.
</p> 
      ]]></content>
    </entry>


</feed>
