gnumed-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Gnumed-devel] Consumable substances


From: Jim Busser
Subject: Re: [Gnumed-devel] Consumable substances
Date: Sat, 03 Sep 2011 18:30:15 -0700

On 2011-09-03, at 4:51 PM, Jim Busser wrote:

> On 2011-09-03, at 9:02 AM, Jim Busser wrote:
> 
>> The data set provided by vbanait has been appreciated… it has supplied ~ 375 
>> distinct substances (out of a total of ~ 500 substance-strength 
>> combinations), without atc codes though.
>> 
>> A much larger set is achievable by combining Health Canada and FreeDiams 
>> resources:
>> 
>> - 1795 distinct substances (1232 distinct ATC codes )
>> - 6357 distinct substance / strength combinations (of which 3772 have an ATC 
>> code) available in Canada
>> - Canadian substance names (e.g. acetaminophen) could be translated to one 
>> of the available INN names (e.g. in english = paracetamol).
>> 
>> The file size would be just over a megabyte.
>> 
>> Is this anything wanted?
>> 
>> -- Jim
> 
> Scripts attached. With my internet connection and machine, they ran in less 
> than a minute.
> 
> The resulting INN csv files will contain ~4252 consumable substances having 
> ATCs and file sizes around 150K.
> 
> The resulting Canadian non-INN file is a little bigger (6355 records, ~ 240 
> Kb) because its names do not depend on having an ATC or INN, and therefore it 
> contains substances even when they lack an ATC.
> 
> -- Jim
> 
> 
> <fd2gm_drugs.sh><fd2gm_drugs.sql>


Unfortunately those scripts will need tweaking, on account of (in the case of 
the Canadian non-inn file) containing 4 rows with zero amounts, plus anything 
else I did not find yet.

-- Jim





reply via email to

[Prev in Thread] Current Thread [Next in Thread]