Categories related to this article:




Jun

5

This article is a free writing from the last keynote speaker at the ESWC 2007, Ron Brachman, from Yahoo. I write any ideas during the presentation, which reader might find not complete or structured. Anyway.. enjoy this note.

Knowledge Representation (KR) is a very important ingredient for the semantic web movement. And in the Semweb, KR meets the Web. Semweb can be said as “addding logic to the Web.”
Compare cialis levivia viagra
Order tramadol cod
Phentermine success story
Pharmacies ship phentermine c.o.d method
Phentermine pill
Phentermine and carbs
Phentermine 15 mgs
Eon phentermine
Cod xanax
37.5mg phentermine
Pfizer xanax information
Buy phentermine diet pill
Tramadol drug
Phentermine and birth defects
50 mg tramadol
How does viagra work
Ash of soma
Buy viagra internet
Cialis in the uk
Cheap phentermine 37.5 mg diet pills
Phentermine sale site top
Filing income tax buy tramadol
No prescription phentermine free shipping
Tramadol hcl 50 mg
Canada generic viagra
Discount xanax
Lowest price viagra
Buy phentermine no prescription
Tramadol hydrochloride capsules
Phentermine in stock ready to ship saturday delivery
Inform your doctor medication phentermine dose weight
Cialis side effects
Grapefruit xanax
Side effects of xanax mylan
Order xanax online
Phentermine lortab online
Phentermine blogging
Diet hcl phentermine pill
Generic india viagra
Hydrocodone com
How much weight will you lose on phentermine
Cheap viagra canada
Discount phentermine prescription
Withdrawal from xanax
No fax payday loan buy tramadol
Xanax and pregnancy
Why phentermine
Phentermine faqs
Cialis prescriptions
Phentermine 37.5mg tablets
Order phentermine online and cod shipping
Drug phentermine testing
Cheap overnight phentermine
Cialis generic canada
Luxury hotel rome xanax description
Cheap pharmacy viagra
Tramadol withdrawal
Purchase viagra
Generic viagra uk
On line viagra
Low cost cialis
Phentermine at cost with no prescription
Difference between viagra and levivia
Blue xanax
Vicodin addiction
Time released phentermine
Generic money order viagra
Buying phentermine online
Pulmonary hypertension and viagra
Viagra jokes
Phentermine buy order
Phentermine yellow 30 mg
Adipex diet discount phentermine pill
Is phentermine addictive
Free viagra canada
Phentermine florida
Lisinopril drug interaction viagra

Ron’s thought is that Semantic Web is now transforming the industry. He explained why Yahoo needs the SW, why KR is great, and how we can get profit wildly. However, major web company seem to take different direction.

We should tanks to Wayne Gretzky’s father speak: “don’t skate to where the puck is, but rather to where it will be.” So, not about where we are now or where we have been, but where the puck will be. The discussion is then will be lead to the ideas for future.

The idea behind Web 2.0 are making a huge impact in the industry (see Mark Greaves’ article Marc/April IEEE Intelligent System).

In Yahoo, they focuse on the underlying knowledge needed to move ahead (the science behind Web n.0). Ron thinks there are opportunities here.

The Internet

What is the Internet? According to Wikipedia, we get technical and very dry definition of the Internet. But this is not what most citizen would like to tell you. According to techies Internet is a distributed, heterogenous repository of tools, data, and people — all kind of people. It is a user centered view, where user can have many functionalities to find, use, share, etc “stuff” in the Internet. Stuff here can be any data, knowledge, help, things, people, etc.

Some example of user needs are search/find, communicaton through email, socializing/sharing, buying, selling, matchmaking, experiencing media, etc. A key question for compay like Yahoo is: how to satisfy the needs? NOw, internet is already far far away from ftp, telnet, gopher, etc. Now, we have instance messeging, search engine, e-commerce, social media, networked games, ads, podcasting, blogging, wikis,  voip, etc.

There are some important concepts to watch for. Fragmentation: everyting are composed from various fragments. Relationships: linkages of all sorts are critical things.
User-generated: sell, help, to be help, contact, blog, are strong drivers compared to the need of companies. Social media/”wisdom of crows”. Rich media: putting tv, radio, video, etc on the Internet. CPC/CPM/CTR: is very very important. Cost per click, click through rate.. now become very very important for a large number of companies or bussinesses which rely on the Internet.

The biggest problem is still: how to find stuffs. It is finding and answering problem. What we really care is not searching, but finding stuff and people that matched. Very significant problem: how do you know that you find stuff that you really need? How to know that two things are really different or  the same? This is a problem of Object identity. Problems of integration, mash up, retrofit, synthesis. How do we deal with putting all things together? Lightweight: most people don’t care wheter it is semantic-based or whatever, but care about how they get information, and they can use it. The company and system should be lightweight in order to people can use it easily.

World has changed, from TCP/header,DNS, TELNET, dll to one where the worthwords are things like “mushup”. Thus we need to move from the “Old” computer science. Institution like ACM tagsonomize what they do into hardware, software, data, computer, etc. The study and curriculum also still address their learning following the taxonomy.

We need a new science of the Internet. From that “old” CS to the new one, so the graduated students are ready to joint company like Yahoo to face the problem in the Internet. Some areas:

* “Finding” science and systems
* Community science and system, which drive the Web 2.0
* Algorithmic advertising
* Computational micro-economics. Collaboration between CS and economics. The impact is so greate
* Media experiences
* Data science. Wall Mart dealing with 70 Terrabytes of data per day. How to deal with this?

We need to bring together existing disciplines and nurture the creating of new ones.

Ron bring a high level view of some new areas of thinking, key issues for consideration, and research challenges.

Toward a Science of Finding (moving beyond “search”)

It is more than just classical IR such as structure, index, and context. We should move to an information supply model, from query-based retrieval to context-driven supply. There are huge number of people which like to help you to find what you want: social search. Allow poeople, not search engine.

There is an evolution of commercial web search engine. In the first generation, we use only “on page”, text data such as word frequency and language. In the second generation, we use off-page, web-specific data such as link analysis, click-through data, and anchor-text (how people refer to tis page). The third generation answers “the need behind the queries”. There are semantic analysis (try to understand the meaning of the web), integrate multiple source of data, understanding tthe context (spacial, geography, religion, etc), help the user using spell checker, query refinement, etc; and integration of search and text analysis.

There are potential interesting opportunitis for SW technology for the first two items in the third generation. Also difficult challenges.

User needs results not documents. 60% of the search need are informational, what to learn e.g. definition of hemoglobin. 25% want to go to a page or navigational. Such as we already know the australian air line, but we use google to naviage to to that page. Other need is transactional, want to do something, e.g. buy a device (mediated by web), about 35%. Of course there are gray areas such as need of finding hub, and knowing what’s happen there.

The third generation search focus on user need and answer it directly using text analysis, semantic analysis. Either use hard or soft (partial) mathces, etc. For example, search for britney spears. This could be name of person, headline, company worked with her, etc. Yahoo knows this. So it gives ordinary search results also interesting shortcuts such as buy CDs, get pothos, etc. Searching hotel, also provide the availability of hotel rates. Searching person names and cities, usually need phone numbers and detail address.

Challenges of the third generation of search engine: answer user needs. For example, a search “sea food san francisco”. We can find the category in our data is restaurant, and then give shortcut/suggestion related to this category. We can apply machine learnig to classify and extract structures from users’ context, content, etc. Problem: the data are not always pre-classified.

From IR to Information Supply model

Information driven by user query, and the systrem provide information they need. Another example, after we book a flight, we are suggested to rent a car. In microsoft word, recommend “you seem to be writring a bussiness letter..” etc.

The following notes are only key ideas. Ron’s gave the presentation for last pages so quick because no more time for him.

Technical challenges: representation of context, information, and users. How we match in the right context.

The power of social media: flickr. People tags thousand of photos. The problem: tags are uncontrollable and organic. “the wisdom of crowds”.

We need community science and system works. myspace, flickr, etc. HOw we support people there?

There are four types of communities: social network, knowledge social search, enthusiast/affinity/music, marketplaces.

Nice example: DBLife Portal, extract information created by members of groups.

Book: the wisdom of crowds

Advertising algorithms. Problems in this area:
*match ads to query/context/user
*order the ads
*pricing on a click-through
*ambiguity

What this all about?
*For large open web companies, they move things strongly toward “web 2.0″ applications. Relation and social element are very important.
*Huge and growing among of content is user produced.
*dont forget ads: huge matching and inference problems
* new science are evolving

Toward a science of aggregated organic knowledge
* extraction to integration/rationalization
* wrapper induction
..

Lets skate to where th puck is going to be
Yahoo mission: connect peopleto their passions, their communities, and the world’s knowledge.



You might be interested with these links:




Comments

Comments are accepted in English, Bahasa Indonesia, or Boso Jowo.

Name (required)

Email (required)

Website

Speak your mind

2 Comments so far

  1. ibnu on July 17, 2007 7:02 am

    masih bingung ma penggunaan WS jika dibanding web biasa, apa penggunaan WS dipakai hanya pada pencarian teks saja??


International Events

My Delicious - More

I often get interesting new info from ads. And you?:



Complete list of selected blogs and websites

Please go to the bottom of this page.

Recent Comments

For Beginners (like me)

Bloggers & Portals

Standard Bodies & Interest Groups

Ontologies

Libraries

News and Journals