Questions tagged [information-extraction]

Filter by
Sorted by
Tagged with
0
votes
2answers
45 views

Feature extraction definition

I have difficulty understanding the concept of feature extraction since there are two main ways to describe it. The first one refers to mapping the raw data into a vector in R^d or the translation of ...
0
votes
0answers
14 views

Extracting metrics from natural language

Imagine a text like The revenue amounted to mEUR 124 during the year while last year it resulted in mEUR 100. I want to use ML methods to extract two outputs like ...
0
votes
1answer
11 views

Reconstructing face from randomised embedding

It is fairly agreed in literature that from a given face-embedding (that is a vector of features values) it is possible, with a good amount of effort, to reconstruct the original face, (See here for ...
0
votes
0answers
11 views

Is pooling acceptable to evaluate information extraction?

When dealing with information extraction of unbalanced classes (e.g. the desired class has a prevalence of 0.5%), the required sample size for validation might be huge (thousands of cases and more), ...
1
vote
0answers
25 views

Is there a method to approximately predict a 3D curve given two plane view of the curve?

Let's say the original data contains three variables, so it is a list of (x,y,z). In the figure, the blue and red curves are the lists of (x,z1) and (y,z2), respectively. These two lists are obtained ...
0
votes
1answer
22 views

How to make recognition of the important document's attributes

We have a set of PDFs with the different types of documents from the various companies. The goal: to predict which of them contain some important attributes (for example, document number, customer ...
1
vote
2answers
32 views

Extracting Text Components from unstructured data

I'm trying to understand what types of techniques would be most applicable to the following type of problem. I'm trying to, given a webpage url that contains a recipe, separate the ingredient list ...
0
votes
1answer
29 views

Unsupervised answering for a predefined set of questions

I am working on a project to read up a text segment and find answers to a specific set of questions, in order to do some information extraction. I have a set of text corpus (each of about 3000 words),...
1
vote
1answer
121 views

What Machine Learning library / algorithm should I use to extract pre-defined features from email text?

I'm new to ML, so if it seems like I am making incorrect assumptions on how to go about doing something, please feel free to correct me. I would like to be able to pass in as training data many ...
0
votes
0answers
22 views

Is it better to do feature extraction and select the k most weighted features or do feature selection?

I have 315 data with 12076, I need to reduce the dimension and also select just 10 most important features. What is the best way? Is it better to do feature extraction and choose the features with ...
1
vote
0answers
760 views

Machine learning - Text extraction / pattern identification

I am new to machine learning, There will be set of paragraph and i need to extract postcode from each paragraph. (The postcode may not have the same pattern as they are obtained from different ...
4
votes
1answer
926 views

Is Machine Learning viable for Extracting product Information from webpages?

I have a task to extract product information from a certain set of websites for price analysis. The product group I'm trying to harvest data is well defined, I could easily provide a set with all the ...
3
votes
1answer
85 views

Guessing/extracting from the pool of partially right guesses

Lets assume that I have integer numbers from 1 to 50. I imagine any 5 unique numbers from 1 to 50. A computer can generate an array of 5 random numbers, as many times as it wants. Each time it ...
1
vote
3answers
117 views

Proof that perfect feature extraction doesn't exist?

Anyone is aware of any proofs that "perfect feature extraction doesn't exist" (on any domain, language, vision, etc). Either philosophical or mathematical, are fine. Update: or the inverse ...
1
vote
0answers
113 views

Which model for this information extraction problem?

I am trying to solve the following pattern recognition / information extraction problem. Assume I have a text where each token has been annotated by a single class among $K$ classes available (with a ...