How to get the distribution of a doc over topics (and topic over words) #3

RamtinYazdanian · 2018-03-23T12:54:02Z

Hello,

First of all, thanks for developing this for Python!

I have been looking at the code and I cannot seem to find a way to infer the distribution of a document over the topics in its path from the root to the leaf (which would be the parameter theta in the "Hierarchical Topic Models and the Nested Chinese Restaurant Process" paper) and also the distribution of a topic over the words (which would be betas in the same paper).

For the second case, dividing word counts at a node by the sum of word counts should yield the probabilities of the respective topic over the words, but is that the best approximation of those values or is there a way to get a more accurate one?

rana-alshaikh · 2018-11-14T21:44:24Z

I have the same question, did you find the answer?

gauravkoradiya · 2020-01-10T05:36:25Z

same question here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get the distribution of a doc over topics (and topic over words) #3

How to get the distribution of a doc over topics (and topic over words) #3

RamtinYazdanian commented Mar 23, 2018

rana-alshaikh commented Nov 14, 2018

gauravkoradiya commented Jan 10, 2020

How to get the distribution of a doc over topics (and topic over words) #3

How to get the distribution of a doc over topics (and topic over words) #3

Comments

RamtinYazdanian commented Mar 23, 2018

rana-alshaikh commented Nov 14, 2018

gauravkoradiya commented Jan 10, 2020