Jump to content

Still Working On The Top 300/500/1000 Thai Words One Must Know...


Recommended Posts

Posted

I've had a post in the wings for a long while and decided to finish it off within the next month (hopefully - it might be more than I'd bargained for).

To do that, I need the top 300 list (at least). And to do that, I need to get some questions answered.

So if any of you programming chaps or chapettes know the answers...

1) On and off I've been collecting Thai vocabulary lists from everywhere, in Thai script from Thai courses, existing lists, etc. They are in a spreadsheet individually as well as in one long list. In Excel, what is the macro for figuring out which words are repeated the most times?

2) I have the conversations, in Thai script, for both Pimsleurs and Assimil. Is there an easy way of extracting the vocabulary from each?

3) This one is still unsolved: Converting .pdb Files To Text, Thai vocabulary lists...

Ta in advance...

Posted

Thank you Artamus. I used excel macros years ago but too many to count.

I'll give that one a try.

Posted

1. You probably need VBA for this. Looping through each words in each sheet and creating a new list with with the number of appearance next to each word.

2. What format are they in?

Posted

VBA? Pimsleurs and Assimil are both in Word docs. The list is excel. I'm now getting rid of duplicates from the same courses (some have words listed two and three times, but with similar meanings).

Posted
VBA? Pimsleurs and Assimil are both in Word docs. The list is excel. I'm now getting rid of duplicates from the same courses (some have words listed two and three times, but with similar meanings).

VBA = visual basic for application. Works on all office program.

Extracting Thai vocab from a word document is tricky because there are no space between words in Thai. The only way I can see is to use VBA but still tricky.

Getting rid of duplicates in Excel list can also be done by VBA.

Posted

The duplicates in the excel list as via colour. Each of the different courses are separated out by colour because I didn't know what process I would need for this and thought better safe than sorry.

Posted
The duplicates in the excel list as via colour. Each of the different courses are separated out by colour because I didn't know what process I would need for this and thought better safe than sorry.

Nothing promised but if you can send me the list, I might be able to knock up a few lines of code to get it done.

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Recently Browsing   0 members

    • No registered users viewing this page.



×
×
  • Create New...