 |
|
| Computers Forum Index » Computer Artificial Intelligence » Some help in research..!... |
|
Page 1 of 1 |
|
| Author |
Message |
| vaib... |
Posted: Wed Oct 28, 2009 5:00 pm |
|
|
|
Guest
|
Hi all,
How have you all been doing ? It's been a while since I wrote here.
I have done a project in Single Document Extractive Text Summarization
recently but I am facing a problem. I do not have the DUC standard
documents to test my application. Can anyone point me (or send me) to
the right place ? Thanx.
Ok. Another issue. Since I got involved in AI and liked it very much I
want to more research in it (probably it'll also help me get into a
good college for my MS/Ph.D ). This time again we are two people and
we are trying to decide what area to work on in AI. I would highly
appreciate if you help us with our task of deciding the domain. We
want to work in NLP/IR. We would want to work in an area that has not
been explored much. Please keep in mind that we are just the beginners
who would not be able to do 'extremely' complex stuff. Here are some
areas we have segregated.
1. Indexers ( in search engines ).
2. Review Systems ( employing opinion mining and give out reviews )
3. Semantic web in mobile or any intelligent system for mobiles ( it
could be anything - please share if you have some knowledge/ideas ).
4. Ontology generation from text ( we have heard a lot about it
although we have no idea as to what it is )
5. Information Extraction.
6. Fact extraction from text
7. Tags and folksonomies.
8. Proof, trust and provenance for web information Applications.
9. Intelligent information retrieval
10. Intelligent user interfaces (for Web systems or otherwise)
11. Recognizing web spam
12. Recommendation systems
13. Trend spotting
14. Web link-analysis & graph mining
15. Word Sense Disambiguation.
16. Emotion Analysis ( seems very interesting but can anything
concrete be done here ? )
17. Research in Automated Planning
18. Belief Systems
I know those are a lot of topics but we need suggestions. Your
suggestions. Please don't restrict yourself to only the topics
mentioned above. If you have some exciting, out of the box ( or
'odd' ) topic in mind - kindly share. We are basically looking to do
research in some 'not-so-explored' field.
Thanking in anticipation.
- Vaibhav
[ comp.ai is moderated ... your article may take a while to appear. ] |
|
|
| Back to top |
|
|
|
| Ted Dunning... |
Posted: Thu Nov 05, 2009 5:27 pm |
|
|
|
Guest
|
Vaibhav,
That is a very nice summary of a number of difficult, but very
interesting areas of research. Some of them are more oriented towards
systems-level research in that the problems have mostly to do with
implementation. Others are more limited by problems of conception and
algorithms.
Your summary is good enough to make me curious about who the "we" that
you mention are. Can you say more?
I have added a few comments.
On Oct 28, 6:00 am, vaib <vaibhavpang... at (no spam) gmail.com> wrote:
Quote: ...
1. Indexers ( in search engines ).
Some of the problems here are scaling the indexers and crawlers to web
scale with a small number of machines. Many people have worked on
these problems. See for instance the blixo project. Another major
problem has to do, not so much with indexing per se, but simply the
accumulation of all of the various kinds of data that you get about
web pages. What you need is a very large flexible table store with
many of the properties of a column-oriented database.
On the NLP research side, some of the problems include the issues
involved in seeing text in multiple ways at once without losing the
precision provided by exact matching. Thus, if you decide a word
might be compounded, how can you index it to be efficiently found as
both the compound and the phrase. Synonymy and stemming can be
treated similarly. There are also interesting issues of compression.
Most web scale indexers compress their indexes by simply ignoring many
occurrences of words and only retaining those that are from "higher-
quality" pages.
Finally, there is the question of how you can use more and different
data from anchor text and linking patterns. Can you use the terms
people use in queries in combination with their clicking patterns to
enhance your index?
Quote: 4. Ontology generation from text ( we have heard a lot about it
although we have no idea as to what it is )
You are in good company. The experts on the subject are quite clear
about what it is, but not necessarily clear about what it should be.
Quote: 7. Tags and folksonomies.
8. Proof, trust and provenance for web information Applications.
These two issues are very closely related and very, very important.
They are also related to the spam problem.
Quote: 9. Intelligent information retrieval
Here, I prefer to not put the intelligence in the retrieval system but
instead to reflect the intelligences of my users back at them.
Reflected intelligence is MUCH easier than artificial intelligence and
thus has much higher commercial value in many cases.
This is only very poorly done at present with many pretty much
negative results. A contribution here would be very interesting.
Quote: 14. Web link-analysis & graph mining
See the pregel paper. Open source implementations of pregel similar
to hadoop for map-reduce would be very useful for many people.
Quote: 15. Word Sense Disambiguation.
This is related to the indexing problem mentioned above. Strictly
disambiguating a word is much less useful than indexing the level of
information that you have (or don't have). In most interesting cases,
you can limit the ambiguity of a word, but not eliminate it. Your
indexing should not force you to commit to a single reading.
Quote: 16. Emotion Analysis ( seems very interesting but can anything
concrete be done here ? )
Yes. And many people have done interesting things. One of the most
interesting tidbits I have heard was the use of local vocabulary
measures as an indicator of the mood of the writer.
[ comp.ai is moderated ... your article may take a while to appear. ] |
|
|
| Back to top |
|
|
|
| vaib... |
Posted: Fri Nov 13, 2009 7:13 pm |
|
|
|
Guest
|
On Nov 5, 5:27 pm, Ted Dunning <ted.dunn... at (no spam) gmail.com> wrote:
Quote: Vaibhav,
That is a very nice summary of a number of difficult, but very
interesting areas of research. Some of them are more oriented towards
systems-level research in that the problems have mostly to do with
implementation. Others are more limited by problems of conception and
algorithms.
[...]
Oh I thought no one was going to respond here. Sir I'll get back to
you soon. I had come closer to making a decision but your response
makes me rethink now.
Thanks for responding.
Regards,
Vaibhav
[ comp.ai is moderated ... your article may take a while to appear. ] |
|
|
| Back to top |
|
|
|
|
|
All times are GMT
The time now is Sun Nov 29, 2009 8:43 am
|
|