57923

Creating a wordvector model combining words from other models

I have two different word vector models created using word2vec algorithm . Now issue i am facing is few words from first model is not there in second model . I want to create a third model from two different word vectors models where i can use word vectors from both models without loosing meaning and the context of word vectors.

Can I do this, and if so, how?

Answer1:

You could potentially translate the vectors for the words only in one model to the other model's coordinate space, using other shared words to learn a translation-function.

There's a facility to do this in recent gensim versions – see the TranslationMatrix tool. There's a demo Jupyter notebook included in the docs/notebooks directory, viewable online at:

https://github.com/RaRe-Technologies/gensim/blob/develop/docs/notebooks/translation_matrix.ipynb

You'd presumably take the larger model (or whichever one is thought to be better, perhaps because it was trained on more data), and translate the smaller number of words its missing into its space. You'd use as many common-reference 'anchor' words as is practical.

Recommend

  • “The remote server returned an error: NotFound” Translator API Windows phone 8
  • Better way to map from String constants to int constants in Java
  • How to move a drawable across the screen
  • How to choose the right UIGestureRecognizer?
  • Why do I get this error when I try to open an MDF database via path/filename in Entity Framework?
  • Cyrillic characters in MessageBox class of Sencha Touch
  • NSView leaves artifacts on another NSView when the first is moved across the second
  • Float to 24 Bit Integer in C# [duplicate]
  • Gettext working intermittently
  • Java Graphics2D Translate and Scale
  • Is there an advantage in using global variables instead of pointers?
  • configurable application texts
  • How to access a global variable which is hidden by a macro definition?
  • django.template.base.TemplateSyntaxError: 'block' tag with name 'bottom_js' appe
  • Web Front End For Perl Script [closed]
  • How to scroll window contents using Direct2D api?
  • li element backgrounds not obeying the border radius on transformations
  • Create properties in class from list / attribute in __init__
  • UIImageView loses quality after panning?
  • Unresolved reference to constructor which is defined [duplicate]
  • How to use jQuery to process json answer from a server
  • LNK2005, LNK1169 errors, “int __cdecl g(void)” (?g@@YAHXZ) already defined
  • steps in feature-based registration
  • Passing a model into RedirectToAction()
  • Translate Python to Lua: replace a string character in a list
  • Can we write a macro in many lines without the backslash at the end?
  • CSS Hover jitter issue
  • Why does declaration of the same name inside the same declarative region is denied?
  • How to add flags to the google web translator?
  • Visual Studio Debugger Watch problems
  • Automatic Java to C++ conversion [duplicate]
  • PHP gettext: not all strings becomes translated
  • In swift, is there a way to have UIView class also be MPMediaPickerControllerDelegate?
  • lua_getglobal crashing program
  • Android ContentResolver.query always returns same data
  • Passing milliseconds into stored procedure mySQL
  • Is inlining done at compile time or run-time?
  • How to translate std::array C++11 operations to Boost+VS08?
  • Maya 2018, python, move and rotate extracted face
  • Consuming WCF Service through https from Silverlight
  • Can I use Data Annotations or Fluent to convert a value between data store and entity class?
  • Selecting text and show meaning as pop up in android eclipse [closed]
  • Refinery CMS on Ruby Rails: Translating / Localising the Strings in the Blog Module
  • Questions on how compiled programs interact with the operating system
  • Overlaying 2D paths on UIImage without scaling artifacts
  • Rails: Calling .limit(5) changes order of results
  • String split and join get strings from database
  • Jackson processor for java bean to json\\xml serialization
  • Value of type char* cannot be used to initialize an entity of type “char”
  • References to incomplete types
  • Where does string-literal begin and end?
  • Generic Type vs Explicit Type Class/Struct
  • Using GitHub with another Bug-tracker? [closed]
  • Boost Spirit and abstract syntax tree design
  • Where's the GIL in PyPy?
  • angular-translate - Fallback language for determinePreferredLanguage()
  • CABasicAnimation back to its original position after it finish it's 1 cycle
  • Add translation to I18N dynamically
  • C# implementing a application wide cache
  • Reading language from source
  • Can't find file executable in your configured search path for INTEL C++ compiler (Linux)
  • colSums - shifted results
  • How do I fetch name of locale (language) not just a locale string, but rather the name of that langu
  • cosmos not returning any records when using table api
  • Wrong answer for code to convert between Java camel case and C++ underscore identifiers
  • React: Expected an assignment or function call and instead saw an expression
  • Move model forward relative to his rotation XNA
  • Can I use a “\\n” character in strings that are to be translated with Qt Linguist
  • String^ to LPCTSTR in VC++2010 (Windows form application)
  • Design of Generic Singleton Wrapper class
  • Using iText7 (Java) to add a table to an existing PDF and continue on additional pages
  • Ms Test or NUnit?
  • Obtaining room/area information from Forge api
  • How can I manipulate translation methods of cakephp 3 to become case insensitive?
  • Using extern to refer to the instantiation of a non-static inline function
  • Stop printing php error messages to browser
  • Bing translator exception while integrating with java application
  • display sql custom text from table column result
  • Django admin: how to edit two models with indirect relationship on the single admin view?
  • Split a PDF page in two parts [duplicate]
  • TextView's bounds issue
  • C# equivalent for these VB statements
  • Can I easily evaluate many IQueryables in a single database call using Entity Framework?
  • How to make image rotation and 3D transform simultaneously?
  • Django, how to create a fallback language in a multilingual website?
  • How to transform a FAB into a popup menu?
  • How can I create a random number of D2D shapes (rectangles and ellipses) and refer to them as an arr
  • Icons in a DataGridViewComboBoxColumn
  • Bird's eye view perspective transformation from camera calibration opencv python
  • In Three.js, how to translate a Vector3?
  • How to recode multiple columns in R
  • how to convert english into arabic dynamically
  • How to return value from service to controller in Angularjs
  • Three.js keyframe animation
  • What is adaptive spinning w.r.t lock acquisition?
  • How to concatenate two arrays?
  • gcc: command line args referenced differently in assembly code
  • sh.exe is preventing windows cmd move command from working
  • url rewriting an id with a string variable
  • How to update localizable.strings from app by code?
  • No definition available for static const member with initializer?
  • Is C compiler able to optimize across object file?
  • Translating a phrase using a dictionary
  • How to use switch case in Mustache template?
  • In GTK, how do I change the localized language text on the UI while it's running?
  • Can an empty virtual table exist?
  • Why are templates not redefined, why is it all written in the header file?
  • How to do this coordinate system operation more efficiently?
  • ractivejs component nesting
  • 3D libgdx rotation
  • Newly added fields not showing in Profile Edit form in Django Oscar?
  • Liquibase Data Type Mapping documentation
  • Translating query involving join table from CakePHP 1.3 to 3
  • Django Rest Framework custom field only pair of values (without fieldname as dict key)
  • database design for quiz with different languages
  • Compress and decompress XML in Flex and Java
  • Watch the locale and fetch articles again - Vuejs
  • Is there a version of Ruby's Regexp.match that responds to the order of the matches within the
  • JsDoc to PDF renderer
  • How to customise Gauge needle pointer using D3.js or C3.js?
  • javascript: Calculate difference between two dates
  • Pass dropdown value to code behind static method
  • In C++, is it possible to have a class inherit from one of its member classes?
  • XPath evaluate a query and retrieve the value
  • .cs files can't be opened anymore in VS 2013 Professional Update 4
  • Replacing the Translator service in Symfony 3
  • What happens when I divide by zero?
  • How to make a multi-line textbox's tab-stop working in win32api application
  • Server timezone offset value
  • Faster way to compute likelihood of sequence?
  • Perform a conditional operation on a pandas column
  • Active Directory Account Password Expiration Date with Fine-Grain Password Policy
  • Why do class member functions defined outside the class (but in header file) have to be inlined?
  • SpringData MongoDB cannot determine IsNewStrategy during Auditing
  • The exact mechanism of mapping WhyML into SMT logic
  • in x64 Windows is there a way to run a Runtime.exec() process avoiding 'Registry redirection�
  • Render face of cube map to a quad
  • How can I migrate my WP8 application to universal when it uses a local linq to sql db?
  • When does initialisation of global variables happen?
  • What is the “center” of a Three.js object?
  • Copying different structs to byte arrays
  • Best method for parsing date formats during import datas
  • How to install and setup Testswarm?
  • Moving Object in ScrollViewer
  • How to setCenter mapview with location in google maps sdk for iOS
  • How can i find the size of a dynamically allocated array in C?
  • How do I specify custom wording in a will_paginate view helper?
  • Sorting parallel arrays in javascript
  • How to specify columns in Swagger
  • Generic collection in generic class
  • What's a better way to swap two argument values?
  • AWS S3 implementation for serving private files
  • How to use a decaying learning rate with an estimator in tensorflow?
  • Detect when user takes a screen shot in windows with print screen?
  • How to parse Java properties which contains variables?
  • What are the use cases of dlopen vs standard dynamic linking?
  • Getting coordinates of a component in java
  • Connect Node.js with Oracle on Windows platform
  • d3.js: why is d3.geo.path() giving NaN?
  • Extending the Django 1.11 User Model
  • Watson Conversation - Why is the ANYTHING ELSE node not chosen
  • Cross platform UI spacing/padding
  • Less Conflicting Session Manager for Zope 2
  • Retaining data after updating application
  • Request response issues in biztalk
  • Extract All Possible Paths from Expression-Tree and evaluate them to hold TRUE
  • How to handle elastic beanstalk deployment so it uploads only changed files
  • How to pass a value from ASP.NET MVC controller to ASP.NET webforms control inside MVC View?
  • uml Composition relationships to RDF and OWL
  • Ubuntu and bcrypt
  • Filter strings with regex before casting to numeric
  • JPA flush vs commit
  • How to detect interior vertices in groups of 2d polygons? (E.g. ZIP Codes to determine a territory)
  • Elasticsearch script query involving root and nested values
  • Django model inheritance, filtering models
  • Why use database factory in asp.net mvc?
  • Low TTL with Leveled Compaction, should I reduce gc_grace_seconds to improve read performance withou
  • How do I configure context broker accept post requests from my remote sensor?
  • D3 get axis values on zoom event
  • x64 applications using gdi+: what are the consequences on performance?
  • How to define custom class, title, and target in Link Browser for content elements and the new rte_c
  • Admob requires api-13 or later can I not deploy on old API-8 phones?
  • jQuery .attr() and value
  • Using $this when not in object context
  • Splitting given String into two variables - php
  • Javascript Callbacks with Object constructor
  • 0x202A in filename: Why?
  • Angular 2 constructor injection vs direct access
  • How does Linux kernel interrupt the application?