Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
B
BabelZoo
Overview
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
PLN
BabelZoo
Commits
ade166c2
Unverified
Commit
ade166c2
authored
5 years ago
by
PLN (Algolia)
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
feat(seeds): 1-2 words max
parent
f8c0fb0f
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
6 deletions
+5
-6
loader.py
glossolalia/loader.py
+5
-6
No files found.
glossolalia/loader.py
View file @
ade166c2
import
os
import
string
from
pprint
import
pprint
from
random
import
choice
,
randint
...
...
@@ -37,21 +36,21 @@ def get_lines(filename):
return
all_lines
def
load_seeds
(
corpus
=
None
,
nb_seeds
=
10
):
def
load_seeds
(
corpus
=
None
,
nb_seeds
=
10
,
min_len
=
1
,
max_len
=
2
):
if
corpus
is
None
:
corpus
=
load_text
s
()
corpus
=
load_text
()
seeds
=
[]
for
i
in
range
(
nb_seeds
):
plain_lines
=
filter
(
lambda
k
:
k
!=
"
\n
"
,
corpus
)
plain_lines
=
filter
(
lambda
k
:
k
not
in
"
\n
"
and
len
(
k
)
>
2
,
corpus
)
chosen
=
choice
(
list
(
plain_lines
))
split
=
chosen
.
split
(
" "
)
nb_words
=
randint
(
1
,
len
(
split
))
nb_words
=
randint
(
min_len
,
min
(
max_len
,
len
(
split
)
))
seeds
.
append
(
" "
.
join
(
split
[:
nb_words
]))
return
seeds
def
main
():
lines
=
load_text
s
(
"../
"
)
lines
=
load_text
(
"../KoozDawa/data/genius.txt
"
)
print
(
"Some seeds:"
)
pprint
(
load_seeds
(
lines
))
...
...
This diff is collapsed.
Click to expand it.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment