Skip to content

Commit cbe37c6

Browse files
mgrafungachchipre-commit-ci[bot]github-advanced-security[bot]
authored andcommitted
Staging hi tn (NVIDIA#271)
* Future Implementations for classes - Measure, Money, and Date (NVIDIA#258) * Future Implementations for classes - Measure, Money, and Date Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * Resolved the conflicts with mm_yyyy and date ranges and added the previously removed failing test cases. Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * removed the unused empty string implementation Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * minor fixes for the tagger files Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * reformatted decimal final graph Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * incorporated the suggestion for decimal graph Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Century implementations Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * Working on the yyyy format for the date class Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * reverted yyyy code Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * working on future implementations Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * working on improving the date class accuracy Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * added year prefix for the date class Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * working on the commma cases for date class Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * minor fixes Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * implemented mixed fractions Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * rectified the test case Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * working on quarterly measurements Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * reformatted the prefixes and suffixes for date tagger class Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * replaced text tag with era tag for the date class Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> * Removed the text tag reference from date class verbalizer Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> --------- Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> * update jenkins cache Signed-off-by: Mariana Graterol Fuenmayor <marianag@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Potential fix for code scanning alert no. 821: Unused local variable Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Mariana <47233618+mgrafu@users.noreply.github.com> --------- Signed-off-by: Namrata Gachchi <ngachchi@nvidia.com> Signed-off-by: Mariana Graterol Fuenmayor <marianag@nvidia.com> Signed-off-by: Mariana <47233618+mgrafu@users.noreply.github.com> Co-authored-by: Namrata Gachchi <ngachchi@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: RajanPutty <rputty@nvidia.com>
1 parent 8eef95c commit cbe37c6

23 files changed

Lines changed: 333 additions & 69 deletions

File tree

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
सन्
2+
सन
3+
साल
Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
में
2+
का
3+
की
4+
के
5+
से
6+
तक
7+
ईस्वी
8+
शताब्दी
9+
दशक
10+
सदी
Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
ई. पू. ईसा पूर्व
2+
ई. ईसवी
Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,12 @@
1+
s सेकंड
2+
hr घंटा
3+
h घंटे
4+
min मिनट
5+
doz दर्जन
6+
yr साल
7+
yr वर्ष
8+
hp हॉर्सपॉवर
9+
d दिन
10+
month महीना
11+
months महीने
12+
हफ़्ते हफ़्ते

nemo_text_processing/text_normalization/hi/data/measure/unit.tsv

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -141,14 +141,16 @@ month महीना
141141
months महीने
142142
ct कैरेट
143143
pH पीएच
144+
km/h किलोमीटर प्रति घंटा
144145
km/hr किलोमीटर प्रति घंटा
145146
km/min किलोमीटर प्रति मिनट
147+
m/h मीटर प्रति घंटा
146148
m/hr मीटर प्रति घंटा
147149
mi/s मील प्रति सेकंड
150+
mi/h मील प्रति घंटा
148151
mi/hr मील प्रति घंटा
149152
mi/min मील प्रति मिनट
150153
₹/ac रुपए प्रति एकड़
151154
x बाई
152155
X बाई
153156
* बाई
154-
- से
Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,9 @@
11
रुपए
2-
P पैसे
32
£ पाउंड
43
वॉन
54
$ डॉलर
65
लीरा
76
टका
87
¥ येन
98
नाइरा
10-
यूरो
9+
यूरो
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
रुपए पैसे
2+
पाउंड पेंस
3+
वॉन जिओन
4+
डॉलर सेंट
5+
लीरा कुरस
6+
टका पैसे
7+
येन सेन
8+
नाइरा कोबो
9+
यूरो सेंट

nemo_text_processing/text_normalization/hi/data/numbers/teens_and_ties.tsv

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -79,12 +79,12 @@
7979
८८ अट्ठासी
8080
८९ नवासी
8181
९० नब्बे
82-
९१ इक्यानबे
83-
९२ बानबे
84-
९३ तिरानबे
85-
९४ चौरानबे
86-
९५ पंचानबे
87-
९६ छियानबे
88-
९७ सत्तानबे
89-
९८ अट्ठानबे
82+
९१ इक्यानबे
83+
९२ बानबे
84+
९३ तिरानबे
85+
९४ चौरानबे
86+
९५ पंचानबे
87+
९६ छियानबे
88+
९७ सत्तानबे
89+
९८ अट्ठानबे
9090
९९ निन्यानबे

nemo_text_processing/text_normalization/hi/data/time/hours.tsv

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,4 @@
1+
शून्य
12
एक
23
दो
34
तीन

nemo_text_processing/text_normalization/hi/taggers/cardinal.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -80,6 +80,7 @@ def create_larger_number_graph(digit_graph, suffix, zeros_counts, sub_graph):
8080
graph_ten_thousands |= create_larger_number_graph(teens_and_ties, suffix_thousands, 1, teens_ties)
8181
graph_ten_thousands |= create_larger_number_graph(teens_and_ties, suffix_thousands, 0, graph_hundreds)
8282
graph_ten_thousands.optimize()
83+
self.graph_ten_thousands = graph_ten_thousands
8384

8485
# Lakhs graph and ten lakhs graph
8586
suffix_lakhs = pynutil.insert(" लाख")
@@ -90,6 +91,7 @@ def create_larger_number_graph(digit_graph, suffix, zeros_counts, sub_graph):
9091
graph_lakhs |= create_larger_number_graph(digit, suffix_lakhs, 1, graph_thousands)
9192
graph_lakhs |= create_larger_number_graph(digit, suffix_lakhs, 0, graph_ten_thousands)
9293
graph_lakhs.optimize()
94+
self.graph_lakhs = graph_lakhs
9395

9496
graph_ten_lakhs = create_graph_suffix(teens_and_ties, suffix_lakhs, 5)
9597
graph_ten_lakhs |= create_larger_number_graph(teens_and_ties, suffix_lakhs, 4, digit)
@@ -98,6 +100,7 @@ def create_larger_number_graph(digit_graph, suffix, zeros_counts, sub_graph):
98100
graph_ten_lakhs |= create_larger_number_graph(teens_and_ties, suffix_lakhs, 1, graph_thousands)
99101
graph_ten_lakhs |= create_larger_number_graph(teens_and_ties, suffix_lakhs, 0, graph_ten_thousands)
100102
graph_ten_lakhs.optimize()
103+
self.graph_ten_lakhs = graph_ten_lakhs
101104

102105
# Crores graph ten crores graph
103106
suffix_crores = pynutil.insert(" करोड़")

0 commit comments

Comments
 (0)