Multimodal Deep Learning (D4L4 Deep Learning for Speech and Language UPC 2017)
Deep Learning and Fontli
-
Upload
imaginea-technologies -
Category
Technology
-
view
367 -
download
0
Transcript of Deep Learning and Fontli
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
DEEP LEARNING & FONTLI
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
I would like to understand things better, but I don’t want to understand them perfectly.
- Douglas Hofstadter
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
In Fluid Concepts and Creative Analogies, Douglas Hofstadter gives centre stage to analogy making as a foundation for intelligence
Hofstadter talks about derivative projects – one of which (“Letter Spirit”) is about analogy making in the typography space : the question being “if <word1> : <insert-stylized-word1> then <other-letters> : ?”.
In a sentence, if I show the system a particular stylized presentation of the letters “abcd”, how do you get it to construct other letters such as “wxyz” in the same style.
Fontli is a social network for typoholics. Capture typography around you and share it. Let the typetalk develop on Fontli.
So we tried it with a bunch of images largely in three classes – type, patterns and semi-realistic pictures – with some surprisingly good and surprisingly crappy results.
TYPE PATTERNS SEMI-REALISTIC
We wanted to try what “taking the style from an image” would do to a dead simple bland piece of text in black and
white – the “content”, shown below.
The image which was passed to neural-style as the “content” image in all the cases.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
DEEPCONVOLUTION
NEURALNETWORK
(DCNN)
The layers of a deep convolution neural network (DCNN) that are closer to the input tend to look at spatially local features
The layers deeper down tend to be concerned with higher level and even conceptual features
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
The images were rendered using the 19-layer network which is the default in the neural-style tool
The amount of content has been kept small in these images deliberately – the proportion varying
between 5:1000 and 50:1000
Each image took about 10mins to generate on a g2.2xlarge instance on AWS
DCNN+
FONTLI
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE RESULTS …
A top favourite. The swirly strokes of the pattern have been beautifully woven to form the letters amidst the chaos of flowy colours.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE RESULTS …
The tree pattern has lots of whirlies all over, which appears to be the reason the whirlieshave been used to express the letter shaping. Note that the “n” appears in bold and somewhat like the trunk of the tree.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE RESULTS …
The colours and texture of the mat pieces in the style image have been used pretty effectively to create the letters. Again, the “content” is set to low in this case. We’ve also lost some of the regularity of the whole pieces in the style image, which we might consider to be the “content” of the style image.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE RESULTS …
The colour palette and the texture of the colouring has transposed nicely on to the result image as expected.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE RESULTS …
This is interesting because the vertical strokes of the tree bark have been preserved more or less intact, while locally altering the lines to reveal the letters. The notches in the style image have been used to make the characters. See how the lines converge towards the dot of the “i”.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE RESULTS …
One of those exceptional cases where something interesting came out of using a typography image as the style. That said, I had no idea such an image would result –i.e. the result in unpredictable.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE RESULTS …
Another favourite. The strokes of the “kolam” pattern and the colourshave been used beautifully to shape the letters.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE RESULTS …
The thorny pattern seems to have been used as the “pencil” to draw out the letters this time.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE HEAD-SCRATCHERS …
Realistic photographs did absolutely nothing.
Also, since the ImageNet training set doesn’t attempt to teach typography to the neural net, we
also didn’t have much success trying to use the letter styles from an image and apply it to new text.
The results were mixed, with decent results some times, but head-scratchers at other times.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE HEAD-SCRATCHERS …
Using real-ishphotographs does absolutely nothing artistically interesting to the text! This turned out to be the crappiest of what we tried :)
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
THE HEAD-SCRATCHERS …
We thought the sketchiness would transfer over to the lettering, but that didn’t happen. As noted, this area is a bit unpredictable. Maybe the sketching is quite far away from what we might find in photographs.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
LESSONS LEARNT
What we’ve seen in this quick experiment is that the net trained on real world photos does not quite have
the ability to read typography. This isn’t surprising since typography neither features strongly in the
collection nor have such images been labelled well. That said, we can indeed make some very interesting images that present lettering and we hope to include
this functionality in Fontli.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
Are you looking to solve complex problems using Deep Learning?Explore Deep Learning with Imaginea…
GET IN TOUCH WITH US >>
Pramati’s M&A’s of Leading products
Serving from 5 Global Locations
Over 200 Product Companies
Unique Products & Services
Agile Methodology
User-centric design
Open Source Contributions
Products built from conception-code-cash
Imaginea: Services Company with Product DNA
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.
Disclaimer
This document may contain forward-looking statements concerning products and strategies. These statements are based on management's current expectations and actual results may differ materially from those projected, as a result of certain risks, uncertainties and assumptions, including but not limited to: the growth of the markets addressed by our products and our customers' products, the demand for and market acceptance of our products; our ability to successfully compete in the markets in which we do business; our ability to successfully address the cost structure of our offerings; the ability to develop and implement new technologies and to obtain protection for the related intellectual property; and our ability to realize financial and strategic benefits of past and future transactions. These forward-looking statements are made only as of the date indicated, and the company disclaims any obligation to update or revise the information contained in any forward-looking statements, whether as a result of new information, future events or otherwise.
All Trademarks and other registered marks belong to their respective owners.
Copyright © 2012-2015, Imaginea Technologies, Inc. and/or its affiliates. All rights reserved.
Credits
Images under Creative Commons Zero license.
Private and confidential. Copyright (C) 2016, Imaginea Technologies Inc. All rights reserve.